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BACKGROUND OF THE INVENTION 



The present application claims the benefit of U.S. Provisional Application Serial 
Number 60/137,665 filed on June 4, 1999. The entire text of the above-referenced 
5 disclosure is herein incorporated by reference. 

A. Field of the Invention 

The present invention relates generally to the fields of biochemistry, molecular 
biology, and virology. More particularly, it relates to the identification of 259 
10 nucleotides of previously unrecognized sequence located at the 3' end of the GB virus B 
(GBV-B) genome. 

B. Description of Related Art 

Chronic hepatitis C is a major threat to the public health. Serologic surveys 
15 suggest that as many as 3.9 million Americans are chronically infected with the 
responsible virus, hepatitis C virus (HCV) (Alter, 1997). These individuals are at 
increased risk of developing progressive hepatic fibrosis leading to cirrhosis and loss of 
hepatocellular function, as well as hepatocellular carcinoma. The course of chronic 
hepatitis C is typically lengthy, often extending over decades, with insidious clinical 
20 progression usually occurring in the absence of symptoms. Nonetheless, liver disease due 
to HCV results in the death of 8,000-10,000 Americans annually, and chronic hepatitis C 
is the most common cause of liver transplantation within the U.S. 

Therefore, HCV is a major public health problem. However, therapy for chronic 
25 hepatitis C is problematic. Recombinant interferon-ot is approved for treatment of 
chronic hepatitis C (Consensus Development Panel, 1997). The benefit of interferon-a 
results primarily from its antiviral properties and its ability to inhibit production of virus 
by infected hepatocytes (Neumann et ai 9 1998). Nonetheless, even under optimal 
therapeutic regimens, the majority of patients with chronic hepatitis C fail to eliminate 
30 the virus or resolve their liver disease. Treatment failures are especially common in 

-2- 

256261.1 



persons infected with genotype 1 HCV, unfortunately the most prevalent genotype in the 
U.S. Thus, there is an urgent need to better understand the virus and develop better 
treatment. Unfortunately, technical difficulties in working with HCV have made it 
necessary to use infectious surrogate viruses in efforts to develop treatments and vaccines 
for HCV. 

Scientists' efforts to better understand HCV and to develop new drugs for 
treatment of hepatitis C have been stymied by two overwhelming technical deficiencies: 
first, the nonexistence of a high permissive cell line that supports replication of the virus 
and second, the absence of a permissive animal species other than chimpanzees, which 
are endangered and therefore available on a limited basis. 

Presently, those who are working on HCV treatment and prevention are 
employing an infectious chimeric virus of sindbis and HCV and/or an infectious clone of 
pestiviruses as surrogate virus models in HCV drug discovery efforts, due to the above 
technical difficulties of working with HCV. Alternatively, they are using isolated 
proteins or RNA segments of HCV for biochemical and structural studies. This approach 
precludes functional studies of virus replication and its inhibition. 

GBV-B is a hepatotropic flavivirus that has a unique phylogenetic relationship to 
human HCV and strong potential to serve as a surrogate virus in drug discovery efforts 
related to hepatitis C antiviral drug development. GBV-B causes acute hepatitis in 
experimentally infected tamarins (Simons et al, 1995; Schlauder et ai 9 1995; Karayiannis 
et aL, 1989) and can serve as a surrogate virus for HCV in drug discovery efforts (due to 
technical difficulties in working with HCV). GBV-B virus is much closer in sequence 
and biological properties than the above-described models. It will be easier to make 
biologically relevant chimeras between HCV and GBV-B than by using more distantly 
related viruses. GBV-B is hepatotropic (as is HCV), whereas the viruses used in these 
competing technologies are not. In view of the above, an infectious clone of GBV-B 
would be useful to those working on HCV treatment and prevention. 
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Unfortunately, the use of GB V-B as a surrogate or model for HC V has not been 
possible in the past, because no infectious molecular clone of GBV-B virus genome could 
be prepared. It is now known that this obstacle was encountered because the GBV-B 
5 genome was believed to be 259 nucleotides shorter than its actual length (Muerhoff et al, 
1995; Simons et aL, 1995). Others, previous to the inventors, had failed to realize that 
the 3 ! sequence of GBV-B was missing from the prior sequences. Without this 3' 
sequence, it is not possible to prepare an.infectious GBV-B molecular clone. 



10 SUMMARY OF THE INVENTION 

As discussed above, an infectious molecular clone of GBV-B would be very 
useful for the development of HCV preventative and therapeutic treatments. The 
construction of an infectious molecular clone of this virus will require the newly 
determined 3 f sequence to be included in order for the clone to be viable. The inventors 
15 have elucidated the previously unrecognized 3 f terminal sequence of GBV-B (SEQ ID 
NO:l). This sequence has been reproducibly recovered from tamarin serum containing 
GBV-B RNA, in RT-PCR protocols using several different primer sets, and as a fusion 
with previously reported 5 f GBV-B sequences. 

20 The newly identified 3' sequence is not included in published reports of the 

GBV-B sequence, nor described in patents relating to the original identification of the 
viral sequence (see U.S. Patent No. 5,807,670 and references therein). 

The invention has utility in that the inclusion of the sequence will be necessary for 
25 construction of an infectious molecular GBV-B clone. Such clones clearly have the 
potential to be constructed as chimeras including relevant hepatitis C virus sequences in 
lieu of the homologous GBV-B sequence, providing unique tools for drug discovery 
efforts. A full-length molecular clone of GBV-was constructed, as described in later 
sections of this specification. 
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GBV-B can be used as a model for HCV, and the GB V-B genome can be used as 
the acceptor molecule in the construction of chimeric viral RNAs containing sequences of 
both HCV and GBV-B. Such studies will allow one to investigate the mechanisms for 
5 the different biological properties of these viruses and to discover and investigate 
potential inhibitors of specific HCV activities (e.g., proteinase) required for HCV 
replication. However, all this work is dependent upon construction of an infectious clone 
of GBV-B, which is itself dependent on the incorporation of the correct 3' terminal 
nucleotide sequence within this clone. GBV-B has unique advantages over HCV in terms 
10 of its ability to replicate and cause liver disease in tamarins, which present fewer 
restrictions to research than chimpanzees, the only nonhuman primate species known to 
□ be permissive for HCV. 

W 5 

03 An infectious molecular clone of GBV-B is expected to have utility in liver- 

Ql 15 specific gene expression or in gene therapy. This application might be enhanced by the 

s i n 

r.I inclusion of HCV genomic sequence in the form of a GBV-B/HCV chimera. Further, an 

;L infectious GBV-B/HCV chimera expressing HCV envelope proteins can have utility as a 

Ql vaccine immunogen for hepatitis C. 

y 20 A full-length cDNA copy of the GBV-B genome was constructed to contain the 

newly identified 3* terminal sequences. RNA transcribed from this cDNA copy of the 
genome would be infectious when inoculated into the liver of a GBV-B permissive 
tamarin, giving rise to rescued GBV-B virus particles. A chimeric molecule would then 
be constructed from this infectious GBV-B clone in which the HCV NS3 proteinase or 

25 proteinase/helicase sequence (or other relevant HCV sequences of interest in drug 
discovery efforts) would be placed in frame in lieu of the homologous GBV-B sequence, 
and this chimeric cDNA would be used to generate infectious GBV-B/HCV chimeric 
viruses by intrahepatic inoculation of synthetic RNA in tamarins. Published studies 
indicate that the GBV-B and HCV proteinases have closely related substrate recognition 

30 and cleavage properties, making such chimeras highly likely to be viable. These newly 
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generated chimeric GB V-B/HC V viruses could be used in preclinical testing of candidate 
HCV NS3 proteinase inhibitors. 



10 



15 



Therefore, the present invention encompasses an isolated polynucleotide encoding 
a 3' sequence of the GBV-B genome. The polynucleotide may include the sequence 
identified as SEQ ID NoilT^It is contemplated that the polynucleotide may be a DNA 
molecule or it can be an RNA molecule. It is further contemplated that expression 
constructs may contain a polynucleotide that has a stretch of contiguous nucleotides from 
SEQ ID NO:J^^o^SEQ ID NO:2, for example, lengths of 50, 100, 150, 250, 500, 
r000,'5000, as well as the entire length of SEQ ID NO:l^or 2, are considered appropriate. 
Such polynucleotides may also be contained in other constructs of the invention or be 
used in the method s of th e invention. Polynucleotides employing sequences from SEQ 
ID NO:l may^imativc^^ontain sequences from SEQ ID NO:2 in^the constructs and 
methods of the ^sSifinvention. 



The invention is also understood as covering a viral expression construct that 
includes a polynucleotide encoding a 3' sequence of the GBV-B genome. This 
expression construct is further understood to contain the sequence identified as SEQ ID 
f NO:l. J'he present invention contemplates the expression construct as a plasmid or as a 
20 virusT Furthermore, the expression construct can express GBV-B sequences; alternatively 
it may express sequences from a chimeric GBV-B/HCV virus. 



The identification and isolation of a 3' sequence of GBV-B additionally provides 
a method of producing a virus, particularly a full-length virus, by introducing into a host 
25 cell a viral expression construct containing a polynucleotide encoding a 3' sequence of 
GBV-B and by culturing the host cell under conditions permitting production of a virus 
from the construct. This method can be practiced using a prokaryotic cell as a host cell, 
or by using a eukaryotic cell as a host cell. Furthermore, the eukaryotic cell can be 
located within an animal. 
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A method of producing virus according to the claimed invention can also be 
employed using a polynucleotide that contains synthetic RNA and/or synthetic DNA. 
Moreover, a step can be added to the method by also isolating any virus produced from 
the host cell. The virus can then be purified to homogeneity. 

5 

In further embodiments, the present invention encompasses an oligonucleotide 
between about 10 and about 259 consecutive bases of SEQ ID NO:l. This 
oligonucleotide is contemplated to be about 15 bases in length, about 20 bases in length, 
about 25 bases in length, about 30 bases in length, about 35 bases inTength, about 50 
10 bases in length, about 100 bases in length, about 150 bases in length, about 200 bases in 
length, or about 259 bases in length. 

Additional examples of the claimed invention include a method for identifying a 
compound active against a viral infection by providing a virus expressed from a viral 
15 construct containing a 3 f sequence of a GBV-B virus, by contacting the virus with a 
candidate substance; and by comparing the infectious ability of the virus in the presence 
of the candidate substance with the infectious ability of the virus in a similar system in 
the absence of the candidate substance. It is contemplated that the invention can be 
practiced using GBV-B virus or a GBV-B/HCV chimera. 

20 

The present invention can also be understood to provide a compound active 
against a viral infection identified by providing a virus expressed from a viral construct 
containing a 3 1 sequence of a GBV-B virus; contacting the virus with a candidate 
substance; and comparing the infectious ability of the virus in the presence of the 
25 candidate substance with the infectious ability of the virus in a similar system in the 
absence of the candidate substance. In some embodiments an active compound is 
identified using a GBV-B virus, while in other embodiments an active compound is 
identified using a GBV-B/HCV chimera. 
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in 



As used herein the specification, "a" or "an" may mean one or more. As used 
herein in the claim(s), when used in conjunction with the word "comprising," the words 
"a" or "an" may mean one or more than one. As used herein "another" may mean at least 
a second or more. 

5 

Other objects, features and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, however, that 
the detailed description and the specific examples, while indicating preferred 
embodiments of the invention, are given by way of illustration only, since various 
10 changes and modifications within the spirit and scope of the invention will become 
apparent to those skilled in the art from this detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to 
15 further demonstrate certain aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

FIG. 1. Predicted secondary RNA structure at the 3 5 end of the novel 3' GBV-B 
20 sequence. The structure shown is that predicted for the 3' terminal 108 nucleotides of the 
GBV-B genomic RNA by the MFOLD 3.0 computer program of Zucker and Turner, and 
has an initial free energy of -48 kcal/mole. The predicted structure contains 3 stem-loops, 
numbered from the 3' end of the genome and labeled SL-1, SL-2, and SL-3. The 
structure of SL-1 is highly probable, given its terminal position within the genome. 
25 Alternative foldings of SL-2 and SL-3 are possible. 
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DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 



A. GVB-B Virus 

The GBV-B genome structure is very similar to hepatitis C and these viruses 
share approximately 25% nucleotide identity (Simons et aL, 1995; Muerhoff et aL, 1995). 
5 As indicated above, this makes GBV-B more closely related to HCV than any other 
known virus. GBV-B genomic RNA is about 9.5 kb in length (Muerhoff et aL, 1995) 

with a structured 5' noncoding region that contains an IRES that shares many structural 

features with the HCV IRES (Honda et aL, 1996; Rijnbrand et aL, 1999). As in HCV, 
this IRES drives the cap-independent translation of a long open reading frame. The 

10 polyprotein expressed from this reading frame appears to be organized identically to that 
of HCV, and processed to generate proteins with functions similar to those of HCV 
(Muerhoff et aL, 1995). In fact, the major serine proteinases of these viruses (NS3) have 
been shown to have similar cleavage specificities (Scarselli et aL, 1997). Finally, like 
HCV and distinct from the pestiviruses, the genomic RNA of GBV-B has a poly(U) tract 

15 located near its 3' terminus (Simons et aL, 1995; Muerhoff et aL, 1995). In addition, 
unreported sequences located at the extreme 3' end of the genome have been identified. 
This work indicates that the GBV-B RNA, like that of HCV (HCV (Tanaka et aL, 1995; 
Kolykhalov et aL, 1996), terminates in a lengthy run of heterogeneous bases (310 nts in 
GBV-B) possessing a readily apparent secondary structure 

20 B. Nucleic Acids 

The present invention provides a nucleic acid sequence encoding a 3 1 sequence of 
the GBV-B genome (SEQ ID NO:l). 

It should be clear that the present invention is not limited to the specific nucleic 
25 acids disclosed herein. As discussed below, a "3' sequence of the GBV-B genome" may 
contain a variety of different bases and yet still be functionally indistinguishable from the 
sequences disclosed herein. Such functionally indistinguishable sequences are likely to 
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maintain the basic structure depicted in FIGs. 1 and 2, which may be used to guide the 
prediction of viable nucleotide substitutions. 

1. Polynucleotides Encoding the 3 1 Sequence of the GBV-V Genome 

A 3 ! sequence of the GBV-B genome disclosed in SEQ ID NO:l is one aspect of 
5 the present invention. Nucleic acids according to the present invention may encode the 3' 
sequence of the GBV-B genome set forth in SEQ ID NO:l, the entire GBV-B genome, or 
any other fragment of a 3' sequence of the GBV-B genome set forth herein. The nucleic 
acid may be derived from genomic RNA as cDNA, i.e., cloned~directly from the genome of 
GBV-B. cDNA may also be assembled from synthetic oligonucleotide segments. 

10 

It also is contemplated that a 3' sequence of the GBV-B genome may be 
represented by natural variants that have slightly different nucleic acid sequences but, 
nonetheless, maintain the same general structure (see FIGs. 1 and 2) and perform the same 
function in RNA replication. 

15 

As used in this application, the term "a nucleic acid encoding a 3 1 sequence of the 
GBV-B genome" refers to a nucleic acid molecule that may be isolated free of total viral 
nucleic acid. In preferred embodiments, the invention concerns nucleic acid sequences 
essentially as set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 

20 SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO:10, SEQ ID NO:ll, and SEQ ID NO:12. The term "as set forth in SEQ ID NO:l" 
means that the nucleic acid sequence substantially corresponds to a portion of SEQ ID 
NO:l. It is contemplated that the techniques and methods described in this disclosure may 
apply to any of the sequences contained herein, including SEQ ID NO:l, SEQ ID NO:2, 

25 SEQ ID NO:3 5 SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, and SEQ ID NO:12. 

Allowing for the degeneracy of the genetic code, sequences that have at least about 
50%, usually at least about 60%, more usually about 70%, most usually about 80%, 
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preferably at least about 90% and most preferably about 95% of nucleotides that are 
identical to the nucleotides of SEQ ID NO:l will be sequences that are "as set forth in SEQ 
ED NO:l." Sequences that are essentially the same as those set forth in SEQ ID NO: 1 may 
also be functionally defined as sequences that are capable of hybridizing to a nucleic acid 
5 segment containing the complement of SEQ ED NO: 1 under standard conditions. 

The nucleic acid segments and polynucleotides of the present invention include 
those encoding biologically functional equivalent 3' sequences of the GBV-B genome. 
Changes designed by man may be introduced through the application of site-directed 
10 mutagenesis techniques or may be introduced randomly and screened later for the desired 
function, as described below. 

3 1 sequence of the GBV-B genome sequences also are provided. Each of the 
foregoing is included within all aspects of the following description. The present 
15 invention concerns cDNA segments reverse transcribed from GBV-B genomic RNA 
(referred to as "DNA"). As used herein, the term "polynucleotide" refers to an RNA or 
DNA molecule that may be isolated free of other RNA or DNA of a particular species. 

"Isolated substantially away from other coding sequences" means that the 3* 
20 sequence of the GBV-B genome forms the significant part of the RNA or DNA segment 
and that the segment does not contain large portions of naturally-occurring coding RNA 
or DNA, such as large fragments or other functional genes or cDNA noncoding regions. 
Of course, this refers to the polynucleotide as originally isolated, and does not exclude 
genes or coding regions later added to the it by the hand of man. 

25 

In certain other embodiments, the invention concerns isolated DNA segments 
(cDNA segments reverse transcribed from GVB-B genomic RNA) and recombinant 
vectors that include within their sequence a nucleic acid sequence essentially as set forth 
in SEQ ID NO: 1 . The term "essentially as set forth in SEQ ID NO: 1" is used in the same 
30 sense as described above. 
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It also will be understood that nucleic acid sequences may include additional 
residues, such as additional 5 f or 3' sequences, and still be essentially as set forth in one of 
the sequences disclosed herein, so long as the sequence meets the criteria set forth above, 
including the maintenance of biological activity. The addition of terminal sequences 
particularly applies to nucleic acid sequences that may, for example, include additional 
various non-coding sequences flanking either of the 5' or 3' portions of the coding region, 
which are known to occur within viral genomes. 

Sequences that are essentially the same as those set forth in SEQ ID NO:l also 
may be functionally defined as sequences that are capable of hybridizing to a nucleic acid 
segment containing the complement of SEQ ID NO:l under relatively stringent 
conditions. Suitable relatively stringent hybridization conditions will be well known to 
those of skill in the art. 

The nucleic acid segments of the present invention, regardless of the length of the 
coding sequence itself, may be combined with other RNA or DNA sequences, such as 
promoters, polyadenylation signals, additional restriction enzyme sites, multiple cloning 
sites, other coding segments, and the like, such that their overall length may vary 
considerably. It is therefore contemplated that a nucleic acid fragment of almost any 
length may be employed, with the total length preferably being limited by the ease of 
preparation and use in the intended recombinant DNA protocol. 

For example, nucleic acid fragments may be prepared that include a short 
contiguous stretch identical to or complementary to SEQ ID NO:l, such as about 15-24 
or about 25-34 nucleotides and that are up to about 259 nucleotides being preferred in 
certain cases. Other stretches of contiguous sequence that may be identical or 
complementary to any of the sequences disclosed herein, including the SEQ ID NOS. 
include the following ranges of nucleotides: 50-9,399, 100-9,000, 150-8,000, 200-7,000, 
250-6,000, 300-5,000, 350-4,000, 400-3,000, 450-2,000, 500-1000. RNA and DNA 
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segments with total lengths of about 1,000, about 500, about 200, about 100 and about 50 
base pairs in length (including all intermediate lengths) are also contemplated to be 
useful. 

5 In a non-limiting example, one or more nucleic acid constructs may be prepared 

that include a contiguous stretch of nucleotides identical to or complementary to SEQ ID 
NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 5 SEQ ID NO:5, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, or SEQ ID 
NO: 12. Such a stretch of nucleotides, or a nucleic acid construct, may be about, or at 
10 least about, 3, about 4, about 5, about 6, about 7, about 8„ about 9, about 10, about 11, 
about 12, about 13, about 14, about 15, about 16, about 17, about 18, about 19, about 20, 
q about 21, about 22, about 23, about 24, about 25, about 26, about 27, about 28, about 29, 

7% about 30, about 31, about 32, about 33, about 34, about 35, about 36, about 37, about 38, 

^ about 39 about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, 

01 15 about 80, about 85, about 90, about 95, about 100, about 105, about 110, about 115, about 

r.i 120, about 125, about 130, about 135, about 140, about 145, about 150, about 155, about 

L. 160, about 165, about 170, about 175, about 180, about 185, about 190, about 195, about 

CP 200, about 210, about 220, about 230, about 240, about 250, about 260, about 270, about 

m 280, about 290, about 300, about 310, about 320, about 330, about 340, about 350, about 

y 20 360, about 370, about 380, about 390, about 400, about 410, about 420, about 430, about 

fee* 

440, about 450, about 460, about 470, about 480, about 490, about 500, about 510, about 
520, about 530, about 540, about 550, about 560, about 570, about 580, about 590, about 
600, about 610, about 618, about 650, about 700, about 750, about 1,000, about 2,000, 
about 3,000, about 4,000, about 5,000, about 6,000, about 7,000, about 8,000, about 
25 9,000, about 9,100, about 9,200, about 9,300, about 9,399, about 9,400, about 9,500, 
about 9,600, about 9,700, about 9,800, about 9,900, about 10,000, about 15,000, about 
20,000, about 30,000, about 50,000, about 100,000, about 250,000, about 500,000, about 
750,000, to about 1,000,000 nucleotides in length, as well as constructs of greater size, up 
to and including chromosomal sizes (including all intermediate lengths and intermediate 
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ranges), given the advent of nucleic acids constructs such as a yeast artificial 
chromosome are known to those of ordinary skill in the art. 

It will be readily understood that "intermediate lengths," in these contexts means 
any length between the quoted ranges, such as 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 
50, 51, 52, 53, etc.; 100, 101, 102, 103, etc.; 150, 151, 152, 153, etc.; including all 
integers through the 200-500; 500-1,000; 1,000-2,000; ranges, up to and including 
sequences of about 1,001, 1,250, 1,500, and the like. 

The various probes and primers designed around the disclosed nucleotide sequences 
of the present invention may be of any length. By assigning numeric values to a sequence, 
for example, the first residue is 1, the second residue is 2, etc., an algorithm defining all 
primers can be proposed: 

n to n + y 

where n is an integer from 1 to the last number of the sequence and y is the length of the 
primer minus one, where n + y does not exceed the last number of the sequence. Thus, 
for a 20-mer, the probes correspond to bases 1 to 20, 2 to 21, 3 to 22 ... and so on. For a 
30-mer, the probes correspond to bases 1 to 30, 2 to 31, 3 to 32 ... and so on. For a 35- 
mer, the probes correspond to bases 1 to 35, 2 to 36, 3 to 37 ... and so on. 

2. Oligonucleotide Probes and Primers 

Naturally, the present invention also encompasses RNA and DNA segments that are 
complementary, or essentially complementary, to the sequence set forth in SEQ ID NO:l. 
Nucleic acid sequences that are "complementary" are those that are capable of base-pairing 
according to the standard Watson-Crick complementary rules. As used herein, the term 
"complementary sequences" means nucleic acid sequences that are substantially 
complementary, as may be assessed by the same nucleotide comparison set forth above, or 
as defined as being capable of hybridizing to the nucleic acid segment of SEQ ID NO. 1 
under relatively stringent conditions such as those described herein. Such sequences may 
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encode the entire 3* sequence of the GBV-B genome or functional or non-functional 
fragments thereof. 

Alternatively, the hybridizing segments may be shorter oligonucleotides. Sequences 
of 17 bases long should occur only once in the human genome and, therefore, suffice to 
specify a unique target sequence. Although shorter oligomers are easier to make and 
increase in vivo accessibility, numerous other factors are involved in determining the 
specificity of hybridization. Both binding affinity and sequence specificity of an 
oligonucleotide to its complementary target increases with increasing length. It is 
contemplated that exemplary oligonucleotides of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175, 200, 225 or 
more base pairs will be used, although others are contemplated. Longer polynucleotides 
encoding 250, 500, 1000, 1212, 1500, 2000, 2500, 3000 or 3431 bases and longer are 
contemplated as well. Such oligonucleotides will find use, for example, as probes in 
Southern and Northern blots and as primers in amplification reactions. 

Suitable hybridization conditions will be well known to those of skill in the art. In 
certain applications, for example, substitution of amino acids by site-directed mutagenesis, 
it is appreciated that lower stringency conditions are required. Under these conditions, 
hybridization may occur even though the sequences of probe and target strand are not 
perfectly complementary but are mismatched at one or more positions. Conditions may be 
rendered less stringent by increasing salt concentration and decreasing temperature. For 
example, a medium stringency condition could be provided by about 0. 1 to 0.25 M NaCl at 
temperatures of about 37°C to about 55°C, while a low stringency condition could be 
provided by about 0.15 M to about 0.9 M salt, at temperatures ranging from about 20°C to 
about 55°C. Thus, hybridization conditions can be readily manipulated and thus will 
generally be a method of choice depending on the desired results. 

In other embodiments, hybridization may be achieved under conditions of, for 
example, 50 mM Tris-HCl (pH 8.3), 75 mM KC1, 3 mM MgCl 2 , 10 mM dithiothreitol, at 
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temperatures between approximately 20°C to about 37°C. Other hybridization conditions 
utilized could include approximately 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 mM 
MgCl 2 , at temperatures ranging from approximately 40°C to about 72°C. Formamide and 
SDS also may be used to alter the hybridization conditions. 

One method of using probes and primers of the present invention is in the search for 
other viral sequences related to GBV-B or, more particularly, homologs of the GBV-B 
sequence. By varying the stringency of hybridization, and the region of the probe, different 
degrees of homology may be discovered. 

Another way of exploiting probes and primers of the present invention is in site- 
directed, or site-specific, mutagenesis. The technique provides a ready ability to prepare 
and test sequence variants, incorporating one or more of the foregoing considerations, by 
introducing one or more nucleotide sequence changes into complementary DNA. Site- 
specific mutagenesis allows the production of mutants through the use of specific 
oligonucleotide sequences that encode the DNA sequence of the desired mutation, as well 
as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient 
size and sequence complexity to form a stable duplex on both sides of the deletion 
junction being traversed. Typically, a primer of about 17 to 25 nucleotides in length is 
preferred, with about 5 to 10 residues on both sides of the junction of the sequence being 
altered. 

The technique typically employs a bacteriophage vector that exists in both a 
single stranded and double stranded form. Typical vectors useful in site-directed 
mutagenesis include vectors such as the Ml 3 phage. These phage vectors are 
commercially available and their use is generally well known to those skilled in the art. 
Double stranded plasmids are also routinely employed in site directed mutagenesis, 
which eliminates the step of transferring the gene of interest from a phage to a plasmid. 
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In general, site-directed mutagenesis is performed by first obtaining a single- 
stranded vector, or melting of two strands of a double stranded vector which includes 
within its sequence a DNA sequence encoding the desired protein. An oligonucleotide 
primer bearing the desired mutated sequence is synthetically prepared. This primer is then 
5 annealed with the single-stranded DNA preparation, taking into account the degree of 
mismatch when selecting hybridization conditions, and subjected to DNA polymerizing 
enzymes such as E. coli polymerase I Klenow fragment, in order to complete the 
synthesis of the mutation-bearing strand. Thus, a heteroduplex is formed wherein one 
strand encodes the original non-mutated sequence and the second strand bears the desired 
mutation. This heteroduplex vector is then used to transform appropriate cells, such as E. 
coli cells, and clones are selected that include recombinant vectors bearing the mutated 
sequence arrangement. There are newer and simpler site-directed mutagenesis techniques 
that can also be employed for this purpose. These include procedures marketed in kit 
form that are readily available to one of ordinary skill in the art. 

The preparation of sequence variants of the selected gene using site-directed 
mutagenesis is provided as a means of producing potentially useful species and is not 
meant to be limiting, as there are other ways in which sequence variants of genes may be 
obtained. For example, recombinant vectors encoding the desired gene may be treated 
with mutagenic agents, such as hydroxylamine, to obtain sequence variants. 

3. Antisense Constructs 

In certain embodiments of the invention, the use of antisense constructs of the 3' 
sequence of the GBV-B genome is contemplated. 

Antisense methodology takes advantage of the fact that nucleic acids tend to pair 
with "complementary" sequences. By complementary, it is meant that polynucleotides 
are those which are capable of base-pairing according to the standard Watson-Crick 
complementarity rules. That is, the larger purines will base pair with the smaller 
pyrimidines to form combinations of guanine paired with cytosine (G:C) and adenine 
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paired with either thymine (A:T) in the case of DNA, or adenine paired with uracil (A:U) 
in the case of RNA. Inclusion of less common bases such as inosine, 5-methylcytosine, 
6-methyladenine, hypoxanthine and others in hybridizing sequences does not interfere 
with pairing. 

Targeting double-stranded (ds) DNA with polynucleotides leads to triple-helix 
formation; targeting RNA will lead to double-helix formation. Antisense 
polynucleotides, when introduced into a target cell, specifically bind to their target 
polynucleotide and interfere with transcription, RNA processing, transport, translation 
and/or stability. Antisense RNA constructs, or DNA encoding such antisense RNAs, may 
be employed to inhibit gene transcription or translation or both within a host cell, either 
in vitro or in vivo, such as within a host animal, including a human subject. 

Antisense constructs could be used to block early steps in the replication of GBV- 
B and related viruses, by annealing to 3' terminal sequences and blocking their role in 
negative-strand initiation. 

As stated above, "complementary" or "antisense" means polynucleotide 
sequences that are substantially complementary over their entire length and have very few 
base mismatches. For example, sequences of 15 bases in length may be termed 
complementary when they have complementary nucleotides at 13 or 14 positions. 
Naturally, sequences which are completely complementary will be sequences which are 
entirely complementary throughout their entire length and have no base mismatches. 
Other sequences with lower degrees of homology also are contemplated. For example, an 
antisense construct which has limited regions of high homology but also contains a non- 
homologous region (e.g., ribozyme; see below) could be designed. These molecules, 
though having less than 50% homology, would bind to target sequences under 
appropriate conditions. 
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4. Amplification and PCR™ 

The present invention utilizes amplification techniques in a number of its 
embodiments. Nucleic acids used as a template for amplification are isolated from cells 
contained in the biological sample, according to standard methodologies (Sambrook 
et al, 1989). The nucleic acid may be genomic DNA or RNA or fractionated or whole 
cell RNA. Where RNA is used, it may be desired to convert the RNA to a 
complementary DNA using reverse transcriptase (RT). In one embodiment, the RNA is 
genomic RNA and is used directly as the template for amplification. In others, genomic 
RNA is first converted to a complementary DNA sequence (cDNA) and this product is 
amplified according to protocols described below. 

Pairs of primers that selectively hybridize to nucleic acids corresponding to GBV- 
B sequences are contacted with the isolated nucleic acid under conditions that permit 
selective hybridization. The term "primer," as defined herein, is meant to encompass any 
nucleic acid that is capable of priming the synthesis of a nascent nucleic acid in a 
template-dependent process. Typically, primers are oligonucleotides from ten to twenty 
base pairs in length, but longer sequences can be employed. Primers may be provided in 
double-stranded or single-stranded form, although the single-stranded form is preferred. 

Once hybridized, the nucleic acid:primer complex is contacted with one or more 
enzymes that facilitate template-dependent nucleic acid synthesis. Multiple rounds of 
amplification, also referred to as "cycles," are conducted until a sufficient amount of 
amplification product is produced. 

Next, the amplification product is detected. In certain applications, the detection 
may be performed by visual means. Alternatively, the detection may involve indirect 
identification of the product via chemiluminescence, radioactive scintigraphy of 
incorporated radiolabel or fluorescent label or even via a system using electrical or 
thermal impulse signals. 
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A number of template dependent processes are available to amplify the marker 
sequences present in a given template sample. One of the best known amplification 
methods is the polymerase chain reaction (referred to as PCR™) which is described in 
detail in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159, and each incorporated 
herein by reference in entirety. 

Briefly, in PCR™, two or more primer sequences are prepared that are 
complementarj^ to J^egions on opposite complementary strands of the marker sequence. 
An excess of deoxynucleoside triphosphates are added to a reaction mixture along with a 
DNA polymerase, e.g., Taq polymerase. If the marker sequence is present in a sample, 
the primers will bind to the marker and the polymerase will cause the primers to be 
extended along the marker sequence by adding on nucleotides. By raising and lowering 
the temperature of the reaction mixture, the extended primers will dissociate from the 
marker to form reaction products, excess primers will bind to the marker and to the 
reaction products and the process is repeated. 

A reverse transcriptase PCR™ amplification procedure may be performed in order 
to quantify the amount of mRNA amplified. Methods of reverse transcribing RNA into 
cDNA are well known and described in Sambrook et aL, 1989. Alternative methods for 
reverse transcription utilize thermostable, RNA-dependent DNA polymerases. These 
methods are described in WO 90/07641, filed December 21, 1990, incorporated herein by 
reference. Polymerase chain reaction methodologies are well known in the art. 

Another method for amplification is the ligase chain reaction ("LCR"), disclosed 
in EPA No. 320 308, incorporated herein by reference in its entirety. U.S. Patent 
4,883,750 describes a method similar to LCR for binding probe pairs to a target sequence. 

Qbeta Replicase, described in PCT Application No. PCTYUS87/00880, 
incorporated herein by reference, also may be used as still another amplification method 
in the present invention, 
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An isothermal amplification method, in which restriction endonucleases and 
ligases are used to achieve the amplification of target molecules that contain nucleotide 
S'-falpha-thioJ-triphosphates in one strand of a restriction site also may be useful in the 
amplification of nucleic acids in the present invention. 

Strand Displacement Amplification (SDA) is another method of carrying out 
isothermal amplification of nucleic acids which involves multiple rounds of strand 
displacement and synthesis, i.e., nick translation. A similar method, called Repair Chain 
Reaction (RCR), involves annealing several probes throughout a region targeted for 
amplification, followed by a repair reaction in which only two of the four bases are 
present. The other two bases can be added as biotinylated derivatives for easy detection. 
A similar approach is used in SDA. Target specific sequences also can be detected using 
a cyclic probe reaction (CPR). In CPR, a probe having 3* and 5' sequences of 
non-specific DNA and a middle sequence of specific RNA is hybridized to DNA that is 
present in a sample. Upon hybridization, the reaction is treated with RNase H, and the 
products of the probe identified as distinctive products that are released after digestion. 
The original template is annealed to another cycling probe and the reaction is repeated. 

Still another amplification methods described in GB Application No. 2 202 328, 
and in PCT Application No. PCT/US89/01025, each of which is incorporated herein by 
reference in its entirety, may be used in accordance with the present invention. In the 
former application, "modified" primers are used in a PCR-like, template- and 
enzyme-dependent synthesis. The primers may be modified by labeling with a capture 
moiety (e.g., biotin) and/or a detector moiety (e.g., enzyme). In the latter application, an 
excess of labeled probes are added to a sample. In the presence of the target sequence, 
the probe binds and is cleaved catalytically. After cleavage, the target sequence is 
released intact to be bound by excess probe. Cleavage of the labeled probe signals the 
presence of the target sequence. 
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Other nucleic acid amplification procedures include transcription-based 
amplification systems (TAS), including nucleic acid sequence based amplification 
(NASBA) and 3SR Gingeras etal, PCT Application WO 88/10315, incorporated herein 
by reference. 

5 

Davey et al, EPA No. 329 822 (incorporated herein by reference in its entirety) 
disclose a nucleic acid amplification process involving cyclically synthesizing 
single-stranded RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which 
may be used in accordance with the present invention. 

10 

Miller et al. 9 PCT Application WO 89/06700 (incorporated herein by reference in 
its entirety) disclose a nucleic acid sequence amplification scheme based on the 
hybridization of a promoter/primer sequence to a target ssDNA followed by transcription 
of many RNA copies of the sequence. This scheme is not cyclic, z.e., new templates are 
15 not produced from the resultant RNA transcripts. Other amplification methods include 
"RACE" and "one-sided PCR" (Frohman, 1990 incorporated by reference). 

Methods based on ligation of two (or more) oligonucleotides in the presence of 
nucleic acid having the sequence of the resulting "di-oligonucleotide", thereby amplifying 
20 the di-oligonucleotide, also may be used in the amplification step of the present 
invention. 

Following any amplification, it may be desirable to separate the amplification 
product from the template and the excess primer for the purpose of determining whether 
25 specific amplification has occurred. In one embodiment, amplification products are 
separated by agarose, agarose-acrylamide or poly aery lamide gel electrophoresis using 
standard methods. See Sambrook et aL, 1989. 

Alternatively, chromatographic techniques may be employed to effect separation. 
30 There are many kinds of chromatography that may be used in the present invention: 
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adsorption, partition, ion-exchange and molecular sieve, and many specialized techniques 
for using them including column, paper, thin-layer and gas chromatography. 

Amplification products must be visualized in order to confirm amplification of the 
marker sequences. One typical visualization method involves staining of a gel with 
ethidium bromide and visualization under UV light. Alternatively, if the amplification 
products are integrally labeled with radio- or fluorometrically-labeled nucleotides, the 
amplification products can then be exposed to x-ray film or visualized under the 
appropriate stimulating spectra, following separation. 

In one embodiment, visualization is achieved indirectly. Following separation of 
amplification products, a labeled nucleic acid probe is brought into contact with the 
amplified marker sequence. The probe preferably is conjugated to a chromophore but 
may be radiolabeled. In another embodiment, the probe is conjugated to a binding 
partner, such as an antibody or biotin, and the other member of the binding pair carries a 
detectable moiety. 

In one embodiment, detection is by Southern blotting and hybridization with a 
labeled probe. The techniques involved in Southern blotting are well known to those of 
skill in the art and can be found in many standard books on molecular protocols. See 
Sambrook etaL, 1989. Briefly, amplification products are separated by gel 
electrophoresis. The gel is then contacted with a membrane, such as nitrocellulose or 
nylon, permitting transfer of the nucleic acid and non-covalent binding. Subsequently, 
the membrane is incubated with a chromophore-conjugated probe that is capable of 
hybridizing with a target amplification product. Detection is by exposure of the 
membrane to x-ray film or ion-emitting detection devices. 

One example of the foregoing is described in U.S. Patent 5,279,721, incorporated 
by reference herein, which discloses an apparatus and method for the automated 
electrophoresis and transfer of nucleic acids. The apparatus permits electrophoresis and 
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blotting without external manipulation of the gel and is ideally suited to carrying out 
methods according to the present invention. 

5. Expression Constructs 

In some embodiments of the present invention, an expression construct that 
encodes a 3' sequence of GBV-B is utilized. The term "expression construct" is meant to 
include any type of genetic construct containing a nucleic acid coding for a gene product 
in which part or all of the nucleic acid encoding sequence is capable of being transcribed. 
The transcript may be translated into a protein, but it need not be. Expression includes 
both transcription of a gene and translation of mRNA into a gene product. Expression 
may also include only transcription of the nucleic acid encoding a gene of interest. 

In some constructs, the nucleic acid encoding a gene product is under 
transcriptional control of promoter and/or enhancer. The term promoter will be used here 
to refer to a group of transcriptional control modules that are clustered around the 
initiation site for RNA polymerase II. Much of the thinking about how promoters are 
organized derives from analyses of several viral promoters, including those for the HSV 
thymidine kinase (tk) and SV40 early transcription units. These studies have shown that 
promoters are composed of discrete functional modules, each consisting of approximately 
7-20 bp of nucleic acids, and containing one or more recognition sites for transcriptional 
activator or repressor proteins. 

At least one module in each promoter functions to position the start site for RNA 
synthesis. The best known example of this is the TATA box, but in some promoters 
lacking a TATA box, such as the promoter for the mammalian terminal deoxynucleotidyl 
transferase gene and the promoter for the SV40 late genes, a discrete element overlying 
the start site itself helps to fix the place of initiation. 

Additional promoter elements regulate the frequency of transcriptional initiation. 
Typically, these are located in the region 30-110 bp upstream of the start site, although a 
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number of promoters have recently been shown to contain functional elements 
downstream of the start site as well. The spacing between promoter elements frequently 
is flexible, so that promoter function is preserved when elements are inverted or moved 
relative to one another. In the tk promoter, the spacing between promoter elements can 
be increased to 50 bp apart before activity begins to decline. Depending on the promoter, 
it appears that individual elements can function either co-operatively or independently to 
activate transcription. 

Enhancers were originally detected as genetic elements that increased 
transcription from a promoter located at a distant position on the same molecule of DNA. 
This ability to act over a large distance had little precedent in classic studies of 
prokaryotic transcriptional regulation. Subsequent work showed that regions of nucleic 
acids with enhancer activity are organized much like promoters. That is, they are 
composed of many individual elements, each of which binds to one or more 
transcriptional proteins. 

The basic distinction between enhancers and promoters is operational. An 
enhancer region as a whole must be able to stimulate transcription at a distance; this need 
not be true of a promoter region or its component elements. On the other hand, a 
promoter must have one or more elements that direct initiation of RNA synthesis at a 
particular site and in a particular orientation, whereas enhancers lack these specificities. 
Promoters and enhancers are often overlapping and contiguous, often seeming to have a 
very similar modular organization. 

6. Host Cells and Permissive Cells 

As used herein, the terms "cell," "cell line," and "cell culture" may be used 
interchangeably. All of these terms also include their progeny, which is any and all 
subsequent generations. It is understood that all progeny may not be identical due to 
deliberate or inadvertent mutations. In the context of the present invention, "host cell" 
refers to a prokaryotic or eukaryotic cell, and it includes any transformable organisms that 
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is capable of replicating a vector or virus and/or expressing viral proteins. A host cell 
can, and has been, used as a recipient for vectors, including viral vectors. A host cell may 
be "transfected" or "transformed," which refers to a process by which exogenous nucleic 
acid is transferred or introduced into the host cell. A transformed cell includes the 
primary subject cell and its progeny. A "permissive cell" refers to a cell that supports the 
replication of a given virus and consequently undergoes cell lysis. In the context of the 
present invention, such a virus would include HCV, GBV-B, or other hepatitis viruses. 
In a "nonpermissive cell," productive infection does not result, but the cell may become 
stably transformed. In some embodiments, methods employ permissive cells that are a 
cell line derived from liver cells (liver cell line). 

Host cells may be derived from prokaryotes or eukaryotes, depending upon 
whether the desired result is replication of the vector or expression of part or all of the 
vector-encoded nucleic acid sequences. Numerous cell lines and cultures are available 
for use as a host cell, and they can be obtained through the American Type Culture 
Collection (ATCC), which is an organization that serves as an archive for living cultures 
and genetic materials (www.atcc.org). An appropriate host can be determined by one of 
skill in the art based on the vector backbone and the desired result. A plasmid or cosmid, 
for example, can be introduced into a prokaryote host cell for replication of many vectors. 
Bacterial cells used as host cells for vector replication and/or expression include DH5a, 
JM109, and KC8, as well as a number of commercially available bacterial hosts such as 
SURE® Competent Cells and Solopack™ Gold Cells (Stratagene®, La Jolla). 
Alternatively, bacterial cells such as E. coli LE392 could be used as host cells for phage 
viruses. 

Examples of eukaryotic host cells for replication and/or expression of a vector 
include HeLa, NIH3T3, Jurkat, 293, Cos, CHO, Saos, and PC12. Many host cells from 
various cell types and organisms are available and would be known to one of skill in the 
art. Similarly, a viral vector or virus or virus particle may be used in conjunction with 
either a eukaryotic or prokaryotic host cell, particularly one that is permissive for 
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replication or expression of the vector. It is contemplated that the present invention 
includes vectors composed of viral sequences, viruses, and viral particles in the methods 
of the present invention, and that they may be used interchangeably in these methods, 
depending on their utility. 

Some vectors may employ control sequences that allow it to be replicated and/or 
expressed in both prokaryotic and eukaryotic cells. One of skill in the art would further 
understand the conditions under which to incubate all of the above described host cells to 
maintain them and to permit replication of a vector. Also understood and known are 
techniques and conditions that would allow large-scale production of vectors, as well as 
production of the nucleic acids encoded by vectors and their cognate polypeptides, 
proteins, or peptides. 

7. Pharmaceutical Compositions 

The present invention encompasses the use of a 3' sequence of GBV-B in the 
production of or use as a vaccine to combat HCV infection. Compositions of the present 
invention comprise an effective amount of GBV-B clone as a therapeutic dissolved or 
dispersed in a pharmaceutically acceptable carrier or aqueous medium. The phrases 
"pharmaceutically or pharmacologically acceptable" refer to molecular entities and 
compositions that do not produce an adverse, allergic or other untoward reaction when 
administered to an animal, or a human, as appropriate. 

As used herein, "pharmaceutically acceptable carrier 11 includes any and all 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and 
absorption delaying agents and the like. The use of such media and agents for 
pharmaceutical active substances is well known in the art. Except insofar as any 
conventional media or agent is incompatible with the active ingredient, its use in the 
therapeutic compositions is contemplated. Supplementary active ingredients can also be 
incorporated into the compositions. For human administration, preparations should meet 
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sterility, pyrogenicity, general safety and purity standards as required by FDA Office of 
Biologies standards. 

The biological material should be extensively dialyzed to remove undesired small 
5 molecular weight molecules and/or lyophilized for more ready formulation into a desired 
vehicle, where appropriate. The active compounds will then generally be formulated for 
parenteral administration, e.g., formulated for injection via the intravenous, 
intramuscular, sub-cutaneous, intralesional, or even intraperitoneal routes. The 
preparation of an aqueous composition that contains GVB-B nucleic acid sequences as an 
10 active component or ingredient will be known to those of skill in the art in light of the 
present disclosure. Typically, such compositions can be prepared as injectables, either as 
liquid solutions or suspensions; solid forms suitable for using to prepare solutions or 
suspensions upon the addition of a liquid prior to injection can also be prepared; and the 
preparations can also be emulsified. 

15 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions; formulations including sesame oil, peanut oil or aqueous 
propylene glycol; and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersions. In all cases the form must be sterile and must be fluid 
20 to the extent that easy syringability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms, such as bacteria and fungi. 

Solutions of the active compounds as free base or pharmacologically acceptable 
25 salts can be prepared in water suitably mixed with a surfactant, such as 
hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid 
polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of 
storage and use, these preparations contain a preservative to prevent the growth of 
microorganisms. 

30 
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A GBV-B clone of the present invention can be formulated into a composition in 
a neutral or salt form. Pharmaceutically acceptable salts include the acid addition salts 
(formed with the free amino groups of the protein) and are formed with inorganic acids 
such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, 
5 oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can 
also be derived from inorganic bases such as, for example, sodium, potassium, 
ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. In terms of using peptide therapeutics as 
active ingredients, the technology of U.S. Patents 4,608,251; 4,601,903; 4,599,231; 
10 4,599,230; 4,596,792; and 4,578,770, each incorporated herein by reference, may be 
used. 

The carrier also can be a solvent or dispersion medium containing, for example, 
water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene 

15 glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity 
can be maintained, for example, by the use of a coating, such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. The prevention of the action of microorganisms can be brought about by 
various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 

20 sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the 
injectable compositions can be brought about by the use in the compositions of agents 
delaying absorption, for example, aluminum monostearate and gelatin. 

25 Sterile injectable solutions are prepared by incorporating the active compounds in 

the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredients into a sterile 
vehicle which contains the basic dispersion medium and the required other ingredients 

30 from those enumerated above. In the case of sterile powders for the preparation of sterile 
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injectable solutions, the preferred methods of preparation are vacuum-drying and freeze- 
drying techniques which yield a powder of the active ingredient plus any additional 
desired ingredient from a previously sterile-filtered solution thereof. The preparation of 
more, or highly, concentrated solutions for direct injection is also contemplated, where 
the use of DMSO as solvent is envisioned to result in extremely rapid penetration, 
delivering high concentrations of the active agents to a small area. 

Upon formulation, solutions will be administered in a manner compatible with the 
dosage formulation and in such amount as is therapeutically effective. The formulations 
are easily administered in a variety of dosage forms, such as the type of injectable 
solutions described above, but drug release capsules and the like also can be employed. 

For parenteral administration in an aqueous solution, for example, the solution 
should be suitably buffered if necessary and the liquid diluent first rendered isotonic with 
sufficient saline or glucose. These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this 
connection, sterile aqueous media which can be employed will be known to those of skill 
in the art in light of the present disclosure. For example, one dosage could be dissolved 
in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or 
injected at the proposed site of infusion, (see for example, "Remington's Pharmaceutical 
Sciences" 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will 
necessarily occur depending on the condition of the subject being treated. The person 
responsible for administration will, in any event, determine the appropriate dose for the 
individual subject. 

In addition to the compounds formulated for parenteral administration, such as 
intravenous or intramuscular injection, other pharmaceutically acceptable forms include, 
e.g., tablets or other solids for oral administration; liposomal formulations; time release 
capsules; and any other form currently used, including cremes. 
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Oral formulations include such normally employed excipients as, for example, 
pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate and the like. These compositions take the 
form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or 
powders. In certain defined embodiments, oral pharmaceutical compositions will 
comprise an inert diluent or assimilable edible carrier, or they may be enclosed in hard or 
soft shell gelatin capsule, or they may be compressed into tablets, or they may be 
incorporated directly with the food of the diet. For oral therapeutic administration, the 
active compounds may be incorporated with excipients and used in the form of ingestible 
tablets, buccal tablets, troches, capsules, elixirs, suspensions, syrups, wafers, and the like. 
Such compositions and preparations should contain at least 0.1% of active compound. 
The percentage of the compositions and preparations may, of course, be varied and may 
conveniently be between about 2 to about 75% of the weight of the unit, or preferably 
between 25-60%. The amount of active compounds in such therapeutically useful 
compositions is such that a suitable dosage will be obtained. 

The tablets, troches, pills, capsules and the like may also contain the following: a 
binder, as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such as dicalcium 
phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid and the 
like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, 
lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of 
wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, 
in addition to materials of the above type, a liquid carrier. Various other materials may 
be present as coatings or to otherwise modify the physical form of the dosage unit. For 
instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup of 
elixir may contain the active compounds sucrose as a sweetening agent methyl and 
propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. 
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C. Examples 

The following examples are included to demonstrate preferred embodiments of 
the invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventor 
to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 

EXAMPLE 1 
Elucidation of a 3 ? Sequence of the GBV-B Genome 

The inventors elucidated the previously unrecognized 3 ! terminal sequence of 
GBV-B (SEQ ID NO 1). This sequence was reproducibly recovered from tamarin serum 
containing GBV-B RNA, in RT-PCR nucleic acid amplification procedures using several 
different primer sets, and as a fusion with previously reported 5' GBV-B sequences. 

There is information in the published literature reporting the putative sequences of 
the 5' and 3' termini of the GBV-B genome. The nucleic acid sequences of these termini 
were reportedly determined by ligating the ends of the viral RNA together, amplifying the 
sequence in the region of the resulting junction by reverse-transcription polymerase chain 
reaction (RT-PCR), and sequencing of the cDNA amplification product across the 
junction. However, the inventors believed these results required confirmation. Of 
particular concern was the fact that the 3' terminus appeared to be shorter than the 
equivalent region of other viruses in the family Flaviviridae (especially within the genus 
Hepacivirus) and that the reported 3' sequence lacked a defined RNA hairpin structure 
such as those present in these related viruses. Additional novel sequences at the 3' end of 
the GBV-B genome were investigated using a serum sample collected from a tamarin that 
was experimentally infected with virus. Amplification was used to determine the 
sequence of the 3' end. 
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First, serum (50 jliL) known to contain GBV-B RNA by RT-PCR assay was 
extracted with Trizol, and the RNA was washed and dried. A synthetic oligonucleotide 
was then ligated to the 3' end of the viral RNA. The oligonucleotide, 
5 AATTCGGCCCTGCAGGCCACAACAGTC, which was phosphorylated at the 5 X end 
and chemically blocked at the 3' end, was ligated to the RNA essentially using the 
method described by Kolykhalov et al. (Behrens et ah, 1996). The RNA was initially 
_<UsrolyedJn DMSO and tfie following ;_additiqns L wer^made:_Tris-Cl, pH 7.5 (10 mM),_ 
MgCl 2 (10 mM), DTT (5 mM), hexamine cobalt chloride (1 mM), 10 pmol oligo and 8 U 

10 T4 ligase. The final concentration of DMSO was 30% in a final volume of 10 )iL. The 
ligation reaction was incubated for 4 or 20 hours at 19° C. 1 of the ligation reaction 
was used directly to make cDNA, using a primer complementary to the ligated 
oligonucleotide and the Superscript 2 system, in a final volume of 15 (iL. 1 jaL of cDNA 
was amplified using the Advantage cDNA system (Clontech) and two additional 

15 oligonucleotide primers. These primers included one that was complementary to the 
ligated oligonucleotide {i.e., "negative sense") and a positive-sense primer located near 
the 3' end of the reported GBV-B sequence. A product approximately 290 bases in 
length was obtained, and this was gel purified and directly sequenced. Sequencing was 
done in both directions using the oligonucleotide primers employed for the amplification; 

20 259 bases that had not been previously reported were identified as fused to the sequence 
that had been previously described as the 3" terminus of the viral genome. 

To ensure that this novel 3' sequence from viral RNA could be reproducibly 
amplified, an additional 10 \iL of infected tamarin serum was extracted using Trizol. 
25 cDNA was prepared by reverse transcription using an oligonucleotide primer 
complementary to the penultimate 3' 25 bases of the novel sequence. Amplification was 
then done by PCR using the primer previously utilized for cDNA synthesis and a 
positive-sense primer mapping within the previously published GBV-B sequence. In the 
initial experiments, although a product was readily detected, DNA sequencing showed 
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that this product was missing all of the sequence distal to the poly-U tract. Carrying out 
the cDNA synthesis in the presence of DMSO circumvented this problem. A cDNA 
product of approximately 290 bases was obtained. This was sequenced and shown to 
consist of the 5' primer, 20 bases of the published GBV-B sequence, and 259 bases of the 
5 novel sequence obtained in the preceding experiments and containing the sequence of the 
3^ primer. The sequence of the 3^ end of GBV-B is shown in SEQ ID NO: 1 (FIG. 1). 
The possible secondary RNA structure for this region is shown in FIG. 2, as predicted by 
a computer-based RNA folding program. The presence of a predicted hairpin structure at 
the extreme 3' end of this novel sequence is consistent with its location at the 3' terminus 
10 of the viral RNA. 

The GBV-B cDNA (synthesis described above) was used as a template for PCR 
amplification of the 3' 1553 nucleotides (nts) of the GBV-B genome. This PCR 
amplification product was gel purified and cloned into plasmid DNA using the "Perfectly 
Blunt Cloning Kit" (Novagene). 

EXAMPLE 2 
Construction of an Infectious GBV-B Clone 

The elucidation of a 3 1 sequence of the GBV-B genome will allow those of skill in 
the art to construct and validate an infectious molecular clone of GBV-B. This will be 
done using the following procedures. 

A full-length cDNA copy of the GBV-B genome containing the newly identified 
3' terminal sequences was constructed. RNA transcribed from this cDNA copy of the 
25 genome will be infectious when inoculated into the liver of a GBV-B permissive tamarin, 
giving rise to rescued GBV-B virus particles. 

A 1:1000 dilution of GBV-B infectious tamarin serum was obtained. This 
material was used as a source of viral RNA for the amplification of GBV-B nucleic acid 
30 sequences by reverse-transcription polymerase chain reaction. For amplification of 
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previously reported segments of the GBV-B genome, 250 |iL of the diluted serum was 
extracted with Trizol using the manufacturer's instructions. The final RNA pellet was 
dissolved in 10 (iL of a 100 mM DTT buffer containing 5% RNasin. This material was 
converted into cDNA using Superscript 2 reverse transcriptase and oligonucleotide 
primers designed to be complementary to the reported GBV-B RNA sequence and to 
contain unique restriction sites. This cDNA was amplified using the Advantage cDNA 
kit (Clontech) employing the cDNA primer (negative sensiv as the downstream primer 
. and_a. similar, positive.-sense. upstream^primer, .again .containing a unique_restriction site._ 
The published sequence of GBV-B allowed for the selection of primers in convenient 
areas of the genome containing unique restriction sites. Using this general strategy, the 
inventors amplified segments of the reported GBV-B genome representing: (1) 
nucleotides (nts) 1-1988, using an upstream primer containing a T7 RNA polymerase 
promoter and a BamHl site upstream of nt 1, and a downstream primer containing a 
unique EcolRl site (nt 1978); (2) nts 1968-5337, using a downstream primer containing a 
unique Clal site at position 5o27; (3) nts 5317- 7837, using a downstream primer 
containing a Sail site at nt 7847; and, (4) nts 7837-9143, using a downstream primer 
containing an added Xhol site. It was found necessary to use different PCR conditions 
for each primer set. 

The RT-PCR products generated in these reactions were cloned into plasmid 
DNA after gel purification, using the "Perfectly Blunt Cloning Kit" (Novagene). Ten 
bacterial colonies from each of the four RT-PCR products were analyzed for insert size 
by restriction endonuclease digestion using EcoRl, the sites for this enzyme being 
located on either side of the insert in the resulting plasmids. For three of the RT-PCR 
amplicons, 9 of 10 colonies contained plasmids with the correct size insert. The EcoRl- 
Clal amplicon generated only 1/10 colonies with a correct size insert. Thus, 30 
additional colonies were examined, yielding two more clones with insert of the correct 
size. For each of these plasmids, simple restriction patterns were obtained using two 
restriction enzymes. As these appeared to be correct, the plasmid DNAs were subjected 
to sequencing using an ABI automatic sequencer. 
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EXAMPLE 3 
Nucleotide sequence of the cloned GBV-B cDNA 

The 5' region of the cloned sequence revealed a relatively long nontranslated 
region corresponding to the published sequence of the GBV-B 5'NTR, which includes an 
IRES. This region was followed by a long open reading frame. Near the 3' end of the 
genome a poly-U tract was identified; however, this was shorter than the published 3' 
homopolymeric poly-U region. The sequence from these clones was compared with those 
in the GenBank database (Accession U22304, "Hepatitis GB virus B polypeptide 
complete genome"). Twenty-two nucleotide differences were identified, of which 14 
gave rise to amino acid changes (Table 1). In order to determine whether these changes 
were genuine or RT-PCR artifacts, which could have been introduced due to the very 
small amount of material from which these sequences were amplified, segments of the 
genome containing these changes were reamplified using a serum sample from an 
independently infected tamarin. Of the 14 changes noted in the original cDNA clones, 12 
were not present in these newly amplified sequences and thus were probably RT-PCR 
artifacts (Table 2.). A particularly interesting difference from the published GenBank 
sequence, however, which was present in both the original clones as well as a repeat 
amplification, was a two-nucleotide substitution that obliterated the Sail site present in 
the published sequence. 



Table 1. Differences in the amino acid sequences of GBV-B cDNA clones 
and the GBV-B sequence reported by Simons et al (1995). 





AMINO ACID A 


RT-PCR Products 


GBV-B 


FROM 


from Tamarin 


Protein 


ABBOTT 
SEQUENCE 


12024 




(PCR Reaction #) 


Core 


G 99 — >S 


G (38.1a) 


El 


V 395 ->I 


V (40.2a) 


E2 


D 703 ->N 


D (42.3a) 


E2 


P 7 06->Q 


H (42.3a) 



256261.1 



-36- 



E2 




A (42.3a) 


NS2 




F(42.3a) 


NS2 




A(42 3a) 


NS5A 




L (46 5a) 

-1—/ 1 T^Vn/UI 


NS5A 


A 2082^ A 


I (46.5a) 


NS5A 


^2174 


(not done) 


NS5A 


^2228" 


E (48.6a) 


NS5A 


T2233 - 


T (48.6a) 


NS5A 


A 2236 ->V 


A (48.6a) 


NS5B 


v 2833 ->i 


V (50.7a) 



EXAMPLE 4 
Construction of a full-length GBV-B cDNA clone. 

5 The four GBV-B cDNA inserts described above were cloned into Bluescript ks+ 

using unique restriction sites. Since the unique Sail site that was reported to be present in 
the published GBV-B sequence (nt position 7847) was absent in these cDNA clones, this 
In restriction site was created by engineering two silent nucleotide changes using the "Quick 

^ Change" mutagenesis system (Stratagene). Although the most 5' clones (nucleotides 1- 

y 10 7847) could be readily constructed, attempts to add the remaining 3' clones were 

rji 

O unsuccessful due to rearrangements and deletions. This problem was overcome by use of 

q pACNR1180, a plasmid that had been used to construct an infectious clone of yellow 

w fever virus. Finally, the most 3' 771 nucleotides of GBV-B were excised from the 

plasmid containing the novel, previously unreported 3' sequence, and inserted into the 
15 truncated assembled GBV-B cDNA construct to complete the 3' end. The 3' terminus of 
this full-length cDNA was then subjected to DNA sequencing to confirm its integrity. 
Extensive restriction digests indicated that this construct had the characteristics of a full- 
lengtfrc^DN^ copy of GBV-B virus. Because there is not yet an understanding of which 
culturedreells (if any) might be permissive for GBV-B replication, the infectivity of the 
20 synthetic GBV-B RNA will be assessed by injecting the RNA directly into the liver of a 
susceptible tamarin. 



256261.1 



-37- 



Alternatively, an infectious full-length clone can be produced by the following 
protocol. A plasmid will be made containing a cassette including the 5' and 3" ends of 
the virus flanked by appropriate restriction sites. These constructs have been shown to 
efficiently translate reporter genes, with transcription taking place via a T7 promoter 
placed immediately upstream of the 5'NTR (e.g., see Rijnbrand et al, 1999). The major 
portion of the GBV-B genome would then be amplified by long range RT-PCR. This 
method is now well established for hepatitis C virus and other flavi viruses (Teller et al, 
1996), and it has been used successfully also to amplify rhinovirus RNA. Briefly this 
technique uses "Superscript" reverse transcriptase to synthesize cDNA and a mixture of 
"KlenTaq 1", and "Deep Vent" polymerases to amplify this cDNA . Primers that can be 
used will contain restriction sites to allow cloning of the RT-PCR products into the 
cassette vector. After being transformed into suitable competent bacteria, extensive 
restriction analysis will enable us to determine which clones contain inserts that are of 
full length and which have a high probability of being correct. Apparent full-length 
clones will be analyzed further by coupled transcription-translation using the Promega 
"TnT" system, with the addition of microsomal membranes to allow the cleavage of the 
structural proteins by cellular signalase enzymes. Clones which appear correct by 
restriction analysis and which produce GBV-B proteins, in particular the protein coded 
for by the extreme 3' end of the genome, NS5B, will be selected, and RNA will be 
transcribed from these clones using the Ambion MegaScript system. "Correct" looking 
clones (>10) can be injected directly into a tamarin liver at several sites. A successful 
infection will be determined as described below. If a positive signal is detected the entire 
genome will be amplified and sequenced to determine which plasmid the virus originated 
from. 

EXAMPLE 5 
Rescue of Infectious GVB-B 

Infectious GBV-B will be rescued from synthetic genome-length RNA following 
its injection into the liver of tamarins (Saquinus sp.). In past studies, HAV from synthetic 
RNA in owl monkeys has been recovered (Aotus trivirgatus) (Shaffer et al, 1995), and 
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more recently, the recovery of virus from a chimpanzee injected intrahepatically with 
RNA transcribed from a full-length genotype lb HCV cDNA clone was reported (Beard 
etaL, 1999). 

RNA will be prepared for these experiments using the T7 MegaScript kit 
(Ambion) and a total of 10 jag of plasmid DNA as template. An aliquot of the reaction 
products will be utilized to ensure the integrity of the RNA by electrophoresis in agarose- 
Jbnnaldehyde gels. The remainder of the transcription reaction mix will be frozen at 
-80C° until its injection, without further purification, into the liver of a tamarin. Because 
of the small size of the tamarin, the RNA will be injected under direct visualization 
following a limited incision and exposure of the liver. Under similar conditions, in other 
primate species, RT-PCR-detectable viral RNA or cDNA has not been detected in serum 
samples collected within days of this procedure in the absence of viral replication 
(Kolykhalov et al, 1997; Yanagi et aL 9 1998; Beard et al, 1999). Thus, the appearance 
of RNA in serum collected subsequently from these tamarins will be strong evidence for 
the replication competence of the synthetic RNA. Serum will be collected weekly for six 
weeks, then every other week for an additional 6 weeks from inoculated animals. In 
addition to RT-PCR for detection of viral RNA (see Fig. 2B), alanine aminotransferase 
(ALT) levels will be measured as an indicator of liver injury and to assess liver histology 
in punch biopsies taken at the time of ALT elevation. Maximum viremia and an acute 
phase ALT response is expected to occur around 14-28 days post- inoculation of 
infectious RNA (Simons et ai, 1995; Schlauder et al, 1995; Karayiannis et aL, 1989). 
Transfections will be considered to have failed to give rise to infectious virus if RNA is 
not detected in the serum within 12 weeks of inoculation. Successfully infected animals 
will be followed with twice weekly bleeds until resolution of the viremia, or for 6 
months, whichever is longest. 



EXAMPLE 6 
Construction of GBV-B/HCV Chimeras 
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The GBV-B genome can be used as the acceptor molecule in the construction of 
chimeric viral RNAs containing sequences of both HCV and GBV-B. Such constructs 
will allow one to investigate the mechanisms for the different biological properties of 
these viruses and to discover and investigate potential inhibitors of specific HCV 
activities (e.g., proteinase) required for HCV replication. Different classes of chimeric 
viruses are contemplated. These include: (a) replacement of the GBV-B IRES with that 
of HCV; and (b) replacement of the NS3 major serine proteinase and helicase, and (c) the 
replacement of the NS5B RNA-dependent RNA polymerase with the homologous 
proteins of HCV. 

The chimeric constructs described in the following sections will be made by PCR 
mutagenesis, using high fidelity polymerases and oligonucleotide primers designed to 
include the specific fusions of GBV-B and HCV sequences (Landt et al. 9 1990). First 
round PCR reactions will create the desired fusion, and generate a new "primer" to be 
used in a second PCR reaction spanning the region to a convenient unique restriction site. 
PCR cycles will be kept to the minimum number necessary for successful amplification, 
and all segments of viral sequence that are amplified by PCR will be subjected to DNA 
sequencing to exclude the presence of unwanted PCR-introduced errors. Sequencing will 
be accomplished at UTMB's core Recombinant DNA Laboratory. Amplified segments 
will be kept to the minimum by the exchange of cloned cDNA segments spanning 
convenient restriction sites in subgenomic clones, and where necessary PCR artifacts can 
be corrected by site-directed mutagenesis (QuickChange mutagenesis kit, Stratagene). 

A number of viable positive-strand RNA virus chimeras have been constructed 
previously in which IRES elements have been swapped between different viruses. Most 
of these chimeras have involved the exchange of IRES elements between picornaviruses. 
Others have been successful in constructing viable poliovirus chimeras containing the 
HCV IRES in place of the native poliovirus IRES (Zhao et ai 9 1999; Lu and Wimmer, 
1996). A similar rhino virus 14 chimera containing the HCV IRES has been constructed, 
although its replication phenotype is not as robust as the poliovirus chimera described by 
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Lu and Wimmer (Lu and Wimmer, 1996). More importantly, Frolov et al. (Frolov et al, 
1998) recently reported chimeric flaviviruses in which the HCV IRES was inserted into 
the genetic background of a pestivirus, bovine viral diarrhea virus (BVDV) in lieu of the 
homologous BVDV sequence. Although these viable chimeric polioviruses and 
5 pestiviruses replicate in cell cultures, they are poor surrogates for HCV in animal models 
as neither virus is hepatotropic or causes liver disease. Importantly, Frolov et al. (Frolov 
et al, 1998) demonstrated quite convincingly that the requirement for cis-acting 
replication signals at the 5' terminus of the pestivirus genome was limited to a short 
tetranucleotide sequence. This requirement presumably reflects the need for the 
10 complement of this sequence at the 3' end of the negative strand during initiation of 
positive-strand RNA synthesis. The work of Frolov et al shows that the IRES of BVDV 
does not contain necessary replication signals, or that if these are present within the 
BVDV IRES they can be complemented with similar signals in either the HCV or 
encephalomyocarditis virus (EMCV, a picornavirus) IRES sequence. Since GBV-B and 
15 HCV are more closely related to each other than BVDV and HCV, these studies provide 
strong support for the viability of chimeras containing the HCV IRES in the background 
of GBV-B. 

|i Construction of a viable IRES chimera will be enhanced by studies that have 

20 documented the sequence requirements and secondary structures of the IRES elements of 
both HCV and GBV-B (see Lemon and Honda, 1997; Honda et al, 1996; Rijnbrand et 
al, 1999). To a considerable extent, the work of Frolov et al. (Frolov et al, 1998) was 
guided by studies of the HCV IRES structure. More recently, these studies have been 
extended to include a detailed mutational analysis of the GBV-B IRES. The results of 
25 these experiments indicate that the functional IRES of GBV-B extends from the 5' end of 
structural domain II (nt 62) to the initiator AUG codon (nt 446). This segment of the full- 
length GBV-B clone will be replaced with HCV sequence extending from 5' end of the 
analogous domain II within the HCV IRES (nt 42) to the initiator codon at the 5' end of 
the HCV open reading frame (nt 341) to construct the candidate chimera, "GB/C:IRES". 
30 The source of HCV cDNA for these studies will be the infectious HCV clone, pCV- 
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H77C, which contains the sequence of the genotype la Hutchinson strain virus (Yanagi et 
al, 1998), whose infectivity in a chimpanzee following intrahepatic inoculation with 
synthetic RNA transcribed from pCV-H77C has been confirmed. 

This GB/C:IRES construct will retain two upstream hairpins within the GBV-B 
sequence (stem-loops la and lb), and it is thus analogous to the viable 
"BVDV+HCVdelB2B3Hl" chimera of Frolov et al (Frolov et al, 1998). A second 
chimera can be constructed in which the entire HCV 5' nontranslated RNA will be^ 
inserted in lieu of nts 62-446 of the GBV-B genome ("GB/C:5 V NTR"). This construct 
will add to the inserted HCV sequence the most 5' stem-loop from HCV (stem- loop I). A 
similar insertion was shown to substantially increase the replication capacity of 
BVDV+HCVdelB2B3Hl by Frolov et al (Frolov et al, 1998), providing a replication 
phenotype similar to wild-type BVDV in cell culture. 

It is important to point out that there is strong evidence from multiple lines of 
investigation indicating that it will not be necessary to include coding sequence in these 
IRES chimeras. This is the case even though Reynolds et al. (Reynolds et al, 1995) have 
argued that the HCV IRES extends past the initiator codon, and into the core-coding 
region of that virus. Although Lu and Wimmer (Lu and Wimmer, 1996) found it 
necessary to include HCV core sequence to obtain a viable chimeric poliovirus, the 
BVDV chimeras reported by Frolov et al (Frolov et al, 1998) did not contain any HCV 
coding sequence. This discrepancy may be explained by the observation that the only 
downstream requirement for full activity of both the GBV-B and HCV IRES elements is 
the presence of an unstructured RNA segment (Honda et al, 1996; Rijnbrand et al, 
1999). Presumably, this facilitates interaction of the viral RNA with the 40S ribosome 
subunit in the early steps of cap-independent translation (Honda et al, 1996). The 5' 
GBV-B coding sequence fulfills this criterion (Rijnbrand et al, 1999). 
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EXAMPLE 7 

In vitro Characterization of the Translational Activity of IRES Chimeras 

The fidelity of the genome-length chimeric constructs will be confirmed by 
sequencing any DNA segments that have been subjected to PCR during the construction 
process, as well as confirming sequence at the junction sites. In addition, the translational 
activity of synthetic RNA derived from these constructs will be assessed and compared to 
the translational activity of the wild-type GBV-B and HCV RNAs. These studies will be 
carried out in a cell-free translation assay utilizing rabbit reticulocyte lysates (^jnbrand 
et al 9 1999). Synthetic RNA will be produced by runoff T7 RNA polymerase 
transcription using as template Clal-digested plasmid DNA (BamHI digestion in the case 
of the genome HCV construct) (T7 Megascript kit, Ambion). 3 H-UTP will be added to 
the reaction mix to allow for quantification of the RNA product. Reticulocyte lysates 
(Promega) will be programmed for translation by the addition of RNA (at least 50% full- 
length as determined by agarose gel electrophoresis) at 20, 40 and 80 |ig/ml, and 
translation reactions will be supplemented with microsomal membranes (Promega). 35 S- 
Methionine-labelled translation products will be separated by SDS-PAGE, and the 
quantity of El protein produced from each RNA determined by Phosphorlmager analysis 
(Molecular Dynamics). Comparisons of the activity of the HCV IRES in the background 
of GBV-B and HCV will take into account differences in the methionine content of the 
El proteins of these viruses. Based on previous studies of both the GBV-B and HCV 
IRES elements (Honda et al, 1996; Rijnbrand et ai, 1999), it is expected that these 
studies will confirm that the HCV IRES will retain nearly full activity when placed 
within the GBV-B background. 

EXAMPLE 8 
In vivo Characterization of IRES Chimeras 

Synthetic RNAs produced from each of the two chimeric GBV-B/HCV constructs 
(GB/C:IRES and GB/C:5'NTR) will be tested for their ability to induce infection and 
cause liver disease in susceptible tamarins. These experiments will be carried out as 
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described in Example 2. GB/C:5'NTR may generate viremia and liver injury more 
closely resembling that observed with wild-type GBV-B infection (Frolov et al. 9 1998). 

EXAMPLE 9 

5 Chimeric Flaviviruses Containing the HCV NS3 Serine Proteinase/Helicase 

within the GBV-B Background 

Chimeric flaviviruses containing the HCV NS3 serine proteinase/helicase within 
the GBV-B background are also contemplated within the ^present invention. The 
construction of chimeric flaviviruses containing specific heterologous functional 
polyprotein domains, however, poses a number of special problems. Unlike the situation 
with the IRES, where the relevant RNA segments appear to have a unique function 
restricted to cap-independent translation initiation and interact with host cell 
macromolecules, viral proteins often have multiple functions and may form specific 
macromolecular complexes with other viral proteins that are essential for virus replication 
(Lindenbach and Rice, 1999). Furthermore, such chimeric polyproteins must be 
amenable to efficient processing by the viral proteinases (NS2/NS3 or NS3). This 
requires knowledge of the proteinase cleavage specificities as well as specific sites of 
proteolytic cleavage. Although to date there have been no published studies of the 
processing of the GBV-B polyprotein, the relatively close relationship between GBV-B 
and HCV, about 30% overall amino acid identity within the polyprotein (Muerhoff et ai, 
1995), allows good computer predictions of the alignments of these proteins. The 
crystallographic structures of both the proteinase and helicase domains of the HCV NS3 
protein have been solved (Yao et al, 1997). Thus, both linear alignments and models of 
the 3D structure of the NS3 proteins of these viruses can provide guidelines for designing 
specific chimeric fusions that are likely to preserve function. 
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EXAMPLE 10 
NS3 Proteinase-Domain Chimeras 

In HCV, NS3 contains the major serine proteinase that is responsible for most 
30 cleavage events in the processing of the nonstructural proteins, i.e., those that occur at the 
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NS3/4A, 4A/4B, 4B/5 A and 5A/5B junctions. The active proteinase domain of HCV is 
located within the amino terminal third of the NS3 protein (residues 1-181), which shares 
31% amino acid identity with the analogous segment of the GBV-B polyprotein (GBV-B 
vs HCV-BK) (Muerhoff et aL, 1995). Importantly, the active site of this proteinase 
5 appears to be particularly well conserved in GBV-B. The GBV-B proteinase maintains 
the residues that are responsible for catalysis and zinc binding in the HCV enzyme 
(Muerhoff et aL, 1995), and unlike the NS3 proteinases of some other flaviviruses 
preserves the Phe-154 residue that determines in_ part the Sj specificity pocket of the^ 
enzyme and the preference of the HCV proteinase for substrates with a cysteine residue at 
10 the PI position (Scarselli et aL, 1997). Thus, it is not surprising that the relevant 
proteolytic cleavage sites within the GBV-B polyprotein that are predicted from 
fg alignments with the HCV polyprotein all possess a Cys residue at this position. Of 

z% greatest significance for the proposed experiments, however, is the work of Scarselli et al. 

03 (Scarselli et aL, 1997) who demonstrated that the GBV-B NS3 proteinase is able to 

en 15 effectively process the polyprotein of HCV in experiments carried out in vitro. Using 
synthetic peptide substrates, these investigators demonstrated that the enzymatic activities 
of the GBV-B proteinase (residues 1-181) had kinetic parameters similar to the HCV 
proteinase on NS4A/4B and NS4B/4A substrates HCV. They did not possess reagents 
allowing a determination of whether the HCV proteinase is able to cleave a GBV-B 
20 substrate, but their results indicate that these viral proteinases share important functional 
properties. Therefore, these similarities suggest that the HCV proteinase could function 
in lieu of the GBV-B proteinase if used to replace this segment of an infectious GBV-B 
clone. In addition, experiments with sindbis/HCV chimeras have shown that the HCV 
proteinase can cleave within the framework of a sindbis polyprotein (Filocamo et aL, 
25 1997). 

In considering the design of these NS3 proteinase chimeras, there are two 
additional important considerations. First, in HCV, the cleavage between NS2 and NS3 
occurs in cis, as the result of a zinc-dependent metalloproteinase that spans the NS2/NS3 
30 junction (Hijikata et aL, 1993). As only the NS3 sequences will initially be exchanged, 
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the viability of the resulting chimeras will be dependent upon preservation of the cis- 
active cleavage across a chimeric NS2/NS3 proteinase domain. The alignment of GBV-B 
and HCV sequences shows that residues in HCV that have been shown by Grakoui et ah, 
1993, to be essential for the NS2/NS3 cleavage are conserved in GBV-B (Muerhoff et al f 
5 1995). Additional chimeras that will include the relevant carboxyl-terminal portion of 
NS2 can also be created. 

A second important consideration is that the mature HCV NS3 proteinase 
functions as a noncovalent assembly of the NS3 proteinase domain and the amino 

10 terminal portion of NS4A, a proteinase accessory factor. The details of this association 
are well known, and have been studied at the crystallographic level (Kim et aL 9 1996). 
The N-terminal domain of the folded proteinase contains eight P strands, including one 
contributed directly by the NS4A peptide backbone. X-ray studies have shown that this 
array of p strands gives rise to a much more ordered N-terminus. Thus, the presence of 

15 the NS4A strand seems likely to contribute to the structure of the substrate-binding 
pocket. It is not known whether the NS3 proteinase of GBV-B also requires a similar* 
interaction with NS4A of that virus for complete activity, or, if so, whether the NS4A of 
GBV-B could substitute for NS4A of HCV in forming the fully active NS3 proteinase of 
HCV. The predicted GBV-B NS4A molecule is 54 amino acid residues in length 

20 (Simons, et al, 1995; Muerhoff et aL 9 1995), just as in HCV. However, the level of 
amino acid homology between the NS4A molecules is not especially high, and the 
potential interaction with either NS3 molecule cannot be predicted from this sequence on 
the basis of available knowledge. To overcome this potential problem, chimeras will be 
created in which not only the NS3 proteinase domain of GBV-B is replaced, but also the 

25 relevant NS4A segment as well, with homologous segments of the HCV polyprotein. 
The interaction of the HCV NS3 and NS4A domains represents a unique target for 
antiviral drug design, and it would be beneficial to have this specific interaction present 
in any virus to be used as a surrogate for HCV in the evaluation of candidate antiviral 
inhibitors of HCV proteinase in vivo. 

30 
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The NS3 proteinase chimeras that can be made include "GB/C:NS3 P " which will 
contain the sequence encoding the first 181 amino acid residues of the HCV NS3 
molecule in lieu of that encoding the first 181 residues of GBV-B NS3, and 
"GB/C:NS3 P 4A", which will include the same NS3 substitution as well as the HCV 
5 sequence encoding the amino-terminal segment of NS4A that forms the interaction with 
NS3. The precise NS4A sequence to be included in the latter chimera will be based on 
the modeling studies, which may also suggest more effective fusions of the NS3 
proteinase domain of HCV with the downstream NS3_ helicase domain of GBV-B. The 
source of HCV cDNA for these experiments will be the infectious HCV clone, pCV- 
10 H77C, which contains the sequence of the genotype la Hutchinson strain virus (Yanagi et 
aU 1998). 

EXAMPLE 11 
NS3 Helicase Domain Chimeras 

15 In addition to serine proteinase activity located within the amino-third of NS3, the 

downstream carboxy-terminal two-thirds of the molecule contains an RNA helicase 
activity. These two functional domains appear to be separated by a flexible spacer, 
within which the fusion of HCV proteinase or helicase domain sequences with GBV-B 
sequence will be placed. The exact role of the helicase in the HCV life-cycle is not 

20 known, but it is almost certainly required for dsRNA strand-separation during some phase 
of viral RNA synthesis. The helicase domains of GBV-B and HCV are remarkably well 
conserved, with some regions within the helicase showing as much as 55% amino acid 
identity (Muerhoff et ai, 1995). The GBV-B helicase is more closely related to the HCV 
helicase than all other flaviviral NS3 helicases, and it preserves many residues found 

25 within the conserved helicase motifs of HCV. Thus the HCV NS3 helicase may be 
capable of functioning when placed within the polyprotein of GBV-B, and such a 
chimeric virus may be capable of replication. Residues 182-620 of the GBV-B NS3 
molecule will be substituted with the analogous segment of HCV (Fig. 5, "GB/C:NS3 h "). 
A chimera will also be made in which the entire NS3 and amino terminal NS4 protein 

30 sequences of GBV-B is replaced with the homologous HCV sequences ("GB/C:NS3- 
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4A"). The latter construct will thus represent a dual proteinase-helicase chimera. As 
with the proteinase chimeras, the HCV cDNA will be derived from pCV-H77C (Yanagi 
etal, 1998). 

EXAMPLE 12 
In vitro Characterization of NS3 and NS3-NS4A Chimera 

Prior to being evaluated for infectivity in susceptible tamarins, RNAs produced in 
vitro from these clones will characterized in vitro. This evaluation wilj_b^res1xicted to a 
documentation of the proper processing of the expressed polyprotein {i.e., NS2/NS3 and 
NS3 proteinase functions), since there are no relevant assays that can determine whether 
the helicase or RNA-dependent RNA polymerase activities in these polyproteins are 
sufficient for virus replication. The proteolytic processing of the polyprotein is 
important, however, as it may be altered either by inclusion of the heterologous HCV 
NS3 proteinase in lieu of the natural GBV-B protease, or by a change in the folding of the 
polyprotein induced by inclusion of HCV sequence anywhere within the polyprotein. 
These studies will be carried out in cell-free coupled transcription/translation assays 
("TnT" system, Promega) supplemented with microsomal membranes. Template DNAs 
will be digested with Sail, which restricts the cDNA within the NS5B coding region. 35 S- 
methionine-labelled translation products will be separated by SDS-PAGE, and the mature 
NS3 protein identified by its apparent molecular mass. The NS3 and NS5B proteins will 
be identified by immunoblot analysis using rabbit antisera to the GBV-B NS3 and NS5B 
proteins. Generation of a mature -68 kDa NS3 protein will provide proof of both the cis- 
active NS2/NS3 cleavage and the NS3-mediated cleavage of NS3/NS4A. Similarly, 
identification of a mature, processed NS5B molecule will provide further support for the 
activity of the NS3 proteinase. Controls for these experiments will be the wild-type 
GBV-B polyprotein expressed in similar fashion from the full-length GBV-B clone. If 
necessary to more clearly demonstrate the processing of the nonstructural proteins in 
these constructs, subclones representing the nonstructural region of the chimeric 
sequences could be produced. 
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EXAMPLE 13 
In vivo Characterization of NS3 and NS3-4A Chimeras 

Synthetic RNAs produced from each of the chimeric GBV-B/HCV constructs 
described in the preceding section will be tested for their ability to induce infection and 
cause liver disease in susceptible tamarins. These experiments will be carried out using 
the approach described above. 

EXAMPLE 14 

Chimeric Flaviviruses Containing the HCV NS5B RNA Dependent 
RNA Replicase within the GBV-B Background 

The HCV NS5B molecule contains an RNA-dependent RNA polymerase that 
plays a central role in replication of the virus. Although this molecule represents a prime 
target for drug discovery efforts, it has proven difficult to express NS5B in a form that 
retains enzymatic activity specific for HCV RNA as a substrate. Thus, relatively little is 
known of the functional activity of the HCV replicase, including structure-function 
relationships of NS5B. Despite this, the NS5B proteins of GBV-B and HCV appear to be 
functionally closely related, as they share as much as 43% amino acid identity (Muerhoff 
et al. y 1995). A more important question may be whether an RNA dependent RNA 
polymerase can act on foreign substrates. However, published work has shown that in 
vitro purified HCV polymerase has very little specificity for its template, using hepatitis 
C or globin message with equal fidelity (Behrens et al, 1996; Al et al, 1998). This 
finding is very similar to that obtained with picornaviral polymerases, where it has been 
known for many years that in vitro the enzyme exhibits very little specificity. It has 
always been considered highly likely that this situation would not pertain in vivo where it 
was thought that the interaction of viral or cellular factors with the 3' end of the genome 
would generate template specificity. However, recent reports have shown that the 
removal of the entire 3' untranslated sequence (leaving, however, the poly(A) region 
present) from both the poliovirus and rhinovirus genome does not completely abrogate 
the infectivity of the virus (Todd et al, 1997). Furthermore, virus, which was recovered 
after the initial transfections, was shown to have recovered much of the infectivity of the 
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original virus (Todd et ah, 1997). The mechanism for this recovery of infectivity is at 
present unknown, but these results suggest that the HCV polymerase may be able to 
function to replicate infectious GBV-B/HCV NS5B chimeras. 

Thus a chimeric genome-length virus can be created in which the NS5B coding 
sequence of HCV (amino acids 2422-3014, 593 residues) is inserted within the 
background of GBV-B in lieu of its native RNA-dependent RNA polymerase (amino 
jicids 2274-2864, ,591 residues). TWschimeric virus wouldbe valuable for animal studies 
of candidate antiviral inhibitors of HCV RNA synthesis. 

This NS5B chimera would be evaluated to determine that there was proper 
proteolytic processing of the polyprotein. This would be accomplished by expression of 
the chimeric polyprotein in a coupled translation-transcription reaction, followed by 
immunoblot analysis for the mature NS5B protein, as described for the NS3 and NS3-4A 
chimeras in the preceding section. If these results confirm that the GB/C:5B chimeric 
polyprotein is processed with release of NS5B, experiments in tamarins would progress 
to determine whether synthetic RNA transcribed from the clone is infectious and capable 
of causing liver disease in intrahepatically inoculated animals. These experiments would 
be carried out as described above. 

A chimeric molecule can be constructed from an infectious GBV-B clone in 
which the HCV NS3 proteinase or proteinase/helicase sequence would be placed in frame 
in lieu of the homologous GBV-B sequence, and this chimeric cDNA would be used to 
generate infectious GBV-B/HCV chimeric viruses by intrahepatic inoculation of 
synthetic RNA in tamarins. Published studies indicate that the GBV-B and HCV 
proteinases have closely related substrate recognition and cleavage properties, likely 
making such chimeras viable and capable of initiating viral replication in appropriate cell 
types. 
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EXAMPLE 15 

Chimeric Viruses Containing HCV Structural Proteins within a GBV-B Genetic 
Background, and GBV-B Structural Proteins within an HCV Background 

It is well documented that the structural proteins of one flavivirus may in some 
cases be substituted for those from another member of the family. Such chimeric viruses 
have been recovered from viruses as distantly related to each other as dengue virus and 
tick-borne encephalitis virus (Pletnev et aL 9 1992). More recently, the prM and E proteins 
of Japanese encephalitis virus have been used to replace the_ equivalent proteins in _a 
vaccine strain of yellow fever virus to produce a JE/YF chimera (Chambers et al 9 1999). 
These observations suggest that chimeras in which the structural proteins of HCV have 
been used to replace the homologous proteins of GBV-B may well be viable and capable 
of replication. The isolation of a chimeric virus containing HCV structural proteins, but 
having the growth characteristics of GBV-B virus, could answer many fundamental 
questions concerning the structure and interaction of these proteins in HCV. They would 
also be useful in addressing the nature of the immune response to HCV structural proteins 
in infected primates (Farci et al. f 1992). More to the point of this application, the 
availability of such chimeric viruses would allow studies of candidate HCV vaccines to 
be carried out in the tamarin model. This would be a major advance, because at present 
such studies are limited to chimpanzees (Choo et ah, 1994). 

The basis for the difference in the host ranges of HCV and GBV-B is completely 
unknown. Among many other possibilities, it is conceivable that the host range is 
dependent upon the availability of a specific receptor(s). If this were the case, host range 
might be dependent upon the envelope proteins that must interact with the putative 
cellular receptor. Thus, a chimeric virus containing the envelope proteins of HCV within 
the genetic background of GBV-B might be noninfectious in tamarins (but potentially 
infectious in chimpanzees). Thus, a finding that both structural protein chimeras are 
noninfectious in the tamarin, may require the construction of complementary chimeras in 
which the relevant GBV-B structural proteins will be inserted into the background of an 
infectious HCV clone. If inclusion of the GBV-B envelope proteins within the backbone 
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of HCV confers on the resulting chimera the ability to replicate in tamarins, it will 
confirm an important role for the structural proteins in defining the different host ranges 
of these viruses. More importantly, the resulting virus would be an exceptionally valuable 
resource for future studies as it would contain all of the nonstructural replication 
elements, as well as the 5' and 3' nontranslated regions, of HCV. Such a virus would 
allow the tamarin model to be used to address many unresolved issues in HCV biology 
and pathogenesis. 



EXAMPLE 16 

Construction and Evaluation of Structural Protein Chimeras 

In designing structural protein chimeras, it is important to note that the two 
envelope proteins of HCV, El and E2, form noncovalent heterodimeric complexes that 
are likely to be important in the assembly of infectious virus particles. This is not known 
to be the case with the envelope proteins of GBV-B, but it is likely given similarities in 
the sizes and hydropathy profiles of these proteins (Simons et aL, 1995; Muerhoff et aL, 
1995). Accordingly, the El and E2 proteins will be replaced as a unit, and chimeras 
containing only one of these proteins from the heterologous virus will generally not be 
produced. First, a chimera will be created where the El and E2 regions of GBV-B virus 
are replaced with those of HCV, "GB/C:El-2". The source of HCV cDNA for these 
constructions will be pCV-H77C (Yanagi et aL, 1998). A chimera will also be made in 
which the core protein, in addition to the envelope proteins, is replaced with the 
homologous proteins of HCV ("GB/C:Co-E2"). Additional chimeras will be made to 
determine whether tamarins can be infected with chimeras containing the GBV-B 
structural proteins within the genetic background of HCV. These will include "C/GB:E1- 
2" and "C/GB:Co-E2". The backbone for these chimeras will be pCV-H77C, the 
infectious genotype la cDNA clone developed in the Purcell laboratory at NIAID 
(Yanagi etaL, 1998). 

The specific amino acid sequences of GBV-B to be replaced with the homologous 
segments of HCV have been determined by alignments of the GBV-B and HCV 
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sequences, coupled with the location of signalase cleavage sites predicted to be present 
within the amino terminal third of the GBV-B polyprotein using the computer algorithm 
of Von Heijne. These predicted signalase cleavages lie between residues 156/157 
(core/El), aa 348-349 (E1/E2) and 732/733 (E2/NS2) in the GBV-B sequence. Thus, the 
chimera GB/C:El-2 will contain sequence encoding HCV aa 192-809 in lieu of that 
encoding aa 157-732 in GBV-B, while the insertion in the GB/C:Co-E2 chimera will 
extend from the initiator AUG codon (aa 1) to residue 809 in HCV, and will be spliced 
into GBV-B in lieu of the segment encoding aa l-732 _ in the GBV-B clone. The 
complementary chimeras to be constructed within the background of HCV will involve 
exchanges of the same segments of the genomes. 

EXAMPLE 17 
Characterization of Structural Protein Chimeras 

Prior to being evaluated for infectivity in tamarins, the processing of these 
chimeric polyproteins will be examined in coupled transcription/translation reactions 
supplemented with microsomal membranes, as described in the preceding sections for the 
proteinase and other proposed chimeras. If these results confirm that the polyprotein is 
processed as expected, with production of glycosylated El and E2 proteins from each of 
the chimeras (and seen in similar experiments with HCV proteins), studies would proceed 
in tamarins as previously described. The results of these studies may provide novel 
information on the basis of the host range differences that exist between HCV and GBV- 
B. If these results suggest that the envelope proteins play a critical role in determining 
host range, additional experiments could be carried out with these chimeras in 
chimpanzees (which are permissive for HCV but apparently not for GBV-B). 

EXAMPLE 18 
Further Characterization of Rescued Chimeric Viruses 

Where infection with chimeric viruses is induced in animals that are injected 
within the liver with synthetic RNA, this virus will be passaged in GBV-B naive tamarins 
to further characterize the nature of the infection induced by the chimera. This will be 
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accomplished by taking a pool of the 3 highest titer GBV-B RNA-containing serum 
specimens from the animal that was successfully transfected with RNA, and inoculating 1 
mL of a 1:100 dilution of this pool intravenously into two susceptible animals. These 
animals will be monitored for infection and liver disease. These animals will be followed 
until resolution of the viremia and appearance of antibodies detectable in immunoblots 
with GST-NS3 protein expressed in E. coli, or for at least 6 months should an animal 
sustain a chronic infection. RT-PCR amplification of chimeric segments of the genome 
may be employed to determine whether the altered phenotype results from mutations 
within the heterologous portion of the genome. 

EXAMPLE 19 
Use of GBV-B as Model for HCV 

GBV-B and/or GBV-B/HCV chimeras can be used as a model for HCV. Such 
studies will allow one to investigate the mechanisms for the different biological 
properties of these viruses and to discover and investigate potential inhibitors of specific 
HCV activities (e.g., proteinase) required for HCV replication. 

EXAMPLE 20 

Use of GBV-B/HCV Chimeras to Test Candidate HCV NS3 
Proteinase Inhibitors or Other Inhibitors of HCV 

GBV-B/HCV viruses can be used in preclinical testing of candidate HCV NS3 

proteinase inhibitors or other inhibitors of HCV. 

1. Candidate Substances 

As used herein the term "candidate substance" refers to any molecule that is 
capable of modulating HCV NS3 proteinase activity or any other activity related to HCV 
infection. The candidate substance may be a protein or fragment thereof, a small 
molecule inhibitor, or even a nucleic acid molecule. It may prove to be the case that the 
most useful pharmacological compounds for identification through application of the 
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screening assay will be compounds that are structurally related to other known 
modulators of HCV NS3 proteinase activity. The active compounds may include 
fragments or parts of naturally-occurring compounds or may be only found as active 
combinations of known compounds that are otherwise inactive. However, prior to testing 
5 of such compounds in humans or animal models, it will be necessary to test a variety of 
candidates to determine which ones have potential. 

Accordingly, the active compounds may include fragments or parts of naturally- 
occurring compounds or may be found as active combinations of known compounds that 
are otherwise inactive. As such, the present invention provides screening assays to 
identify agents that are capable of inhibiting proteinase activity in a cell infected with 
chimeric GBV-B/HCV viruses containing the HCV proteinase. It is proposed that 
compounds isolated from natural sources, such as animals, bacteria, fungi, plant sources, 
including leaves and bark, and marine samples may be assayed as candidates for the 
presence of potentially useful pharmaceutical agents. It will be understood that the 
pharmaceutical agents to be screened could also be derived or synthesized from chemical 
compositions or man-made compounds. Thus, it is understood that the candidate 
substance identified by the present invention may be polypeptide, polynucleotide, small 
molecule inhibitors or any other compounds that may be designed through rational drug 
design starting from known inhibitors of proteinases or from structural studies of the 
HCV proteinase. 

The candidate screening assays are simple to set up and perform. Thus, in 
assaying for a candidate substance, after obtaining a chimeric GBV-B/HCV virus with 
25 infectious properties, a candidate substance can be incubated with cells infected with the 
virus, under conditions that would allow measurable changes in infection by the virus to 
occur. In this fashion, one can measure the ability of the candidate substance to prevent 
or inhibit viral replication, in relationship to the replication ability of the virus in the 
absence of the candidate substance. In this fashion, the ability of the candidate inhibitory 
30 substance to reduce, abolish, or otherwise diminish viral infection may be determined. 
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"Effective amounts" in certain circumstances are those amounts effective to 
reproducibly reduce infection by the virus in comparison to the normal infection level. 
Compounds that achieve significant appropriate changes in activity will be used. 
Candidate compounds can be administered by any of a wide variety of routes, such as 
intravenously, intraperitoneally, intramuscularly, orally, or any other route typically 
employed. 

It will, of course, be understood that all the screening methods of the present 
invention are useful in themselves notwithstanding the fact that effective candidates may 
not be found. The invention provides methods of screening for such candidates, not 
solely methods of finding them. 

2. In vitro Assays 

In one particular embodiment, the invention encompasses in vitro screening of 
candidate substances. Using a cell line that can propagate GBV-B in culture, in vitro 
screening can be used such that GBV-B or HCV virus production or some indicator of 
viremia is monitored in the presence of candidate compounds. A comparison between the 
absence and presence of the candidate can identify compounds with possible preventative 
and therapeutic value. 

3. In Vivo Assays 

The present invention also encompasses the use of various animal models to test 
for the ability of candidate substances to inhibit infection by HCV. This form of testing 
may be done in tamarins. 

The assays previously described could be extended to whole animal studies in 
which the chimeric virus could be used to infect a GBV-B permissive primate, such as a 
tamarin. One would then look for suppression of viral replication in the animal, and a 
possible impact on liver disease related to replication of the infectious chimeric virus. 
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The advantage of this in vivo assay over present available assays utilizing HCV infection 
in chimpanzees is the reduced cost and greater availability of GBV-B permissive 
nonhuman primate species. 

Treatment of animals with test compounds will involve the administration of the 
compound, in an appropriate form, to the animal. Administration will be by any route 
that could be utilized for clinical or non-clinical purposes, including but not limited to 
oral, nasal, buccal, rectal, vaginal or topical. Alternatively, administration may be by 
intratracheal instillation, bronchial instillation, intradermal, subcutaneous, intramuscular, 
intraperitoneal or intravenous injection. Specifically contemplated are systemic 
intravenous injection, intraperitoneal injection, and oral administration. 

Determining the effectiveness of a compound in vivo may involve a variety of 
different criteria. Such criteria include, but are not limited to, survival, reduction of rate 
of infection, arrest or slowing of infection, elimination of infection, increased activity 
level, improvement in liver function, and improved food intake. 



Infectious GBV-B/HCV chimeras expressing HCV envelope proteins will have 
utility as a vaccine immunogen for hepatitis C. Such clones clearly have the potential to 
be constructed as chimeras including relevant hepatitis C virus sequences in lieu of the 
homologous GBV-B sequence, providing unique tools for drug discovery efforts. 

Chimeric viruses containing the envelope proteins of hepatitis C virus (as 
described in the attached) would confer the antigenic characteristics of hepatitis C virus 
on the chimera. These chimeras may have the ability to replicate in chimpanzees (and 
thus humans) by virtue of the fact that the chimeric envelope is now able to interact with 
the human hepatocyte cell surface, a necessary first step in virus replication. Therefore, 
the chimeric virus, while able to infect and replicate in humans, may not cause much or 



EXAMPLE 21 



Use of Infectious GBV-B/HCV Chimeras as Vaccines 
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any disease-the reasoning here is that the genetic backbone of the chimera that encodes 
the nonstructural proteins of GBV-B has not evolved for replication in human cells and 
thus may not replicate well. Thus, the chimera may have limited replication ability, cause 
no disease, but still elicit immunity to the surface envelope proteins of HCV and thus 
5 have potential as a hepatitis C vaccine. These chimeras can be tested for their ability to 
promote immunity to HCV through an immune response. 

EXAMPLE 22 
Construction of a GBV-B Infectious Clone 

A genome-length cDNA copy of the complete GB virus B (GBV-B) genome 
sequence, including the novel 3' terminal sequence, was assembled from fragments 
amplified by reverse transcription-polymerase chain reaction (RT-PCR) from viral RNA 
present in a 0.2 (il aliquot of infectious serum (GB agent pool mystrax 666, 8/93) 
supplied by Dr. Jens Bukh of the National Institutes of Health. The serum sample was 
diluted with 100 jal of fetal calf serum and extracted using the Trizol system 
(GIBCO/BRL). The pellet was dissolved in 10 mM dithiothreotol containing 20 u/mL 
RNasin (Promega). The selection of primers for cDNA synthesis and PCR amplification 
was based on the published sequence of GBV-B (Simons et al., 1995). RT/PCR was 
carried out using Superscript Reverse Transcriptase (GIBCO/BRL) and the Advantage 
cDNA PCR Kit (Clontech). Four subgenomic regions were amplified covering the entire 
published GBV-B genome sequence. A fifth subgenomic region included the novel 3' 
terminal sequence that was identified in our laboratory. The oligonucleotide primer sets 
(listed 5 »3') used for amplification of the individual regions included: 

25 Primer 1 : 

CGGGATCCCGTAATACGACTCACTATAGACCACAAACACTCCAGTTTG 

(SEQ ID NO:3) 

Primer 2; GTGGAATTCACAGCGTCATA (SEQ ID NO:4) 
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(Places a T7 RNA polymerase promoter sequence immediately upstream of the 
GBV-B sequence; the amplified segment extends from the 5' terminus of the viral 
genome to the unique EcoRl site at position 1978.) 

5 Primer 3: TGTGAATTCCACTCTCCTACC (SEQIDNO:5) 
Primer 4: TTATCGATTGCAGCAACCATG (SEQIDNO:6) 

(Overlapping EcoRl site at position 1978 and the unique Clal site at 5327.) 



Primer 5: CATGGTTGCTGCAATCGATAAGCTGAAGAGTACAATAAC 

(SEQIDNO:7) 

Primer 6: GACAACAGACGCTTGACACG (SEQIDNO:8) 

(Overlapping the Cla I site at 5327 and a unique Sal I site at 7847 in the published 
sequence: however, the Sal I site was not present in the amplified GBV-B sequence, and 
was thus introduced into the cDNA using the QuikChange Site-directed Mutagenesis Kit 
(Stratagene).) 

Primer 7: CTGTCATGGGAGATGCGTAC (SEQIDNO:9) 
Primer 8: CGAGCTCGAGCACATCGCGGGGTCGTTAAGCCCGGGGTCTCC 

(SEQIDNOrlO) 

(Overlapping the Sal I site at 7847 and the published 3' end of the genome.) 

Primer 9: GACAACAGACGCTTGACACG (SEQIDNO:ll) 
Primer 10: 

CCGACTCGAGAATTCGGCCCTGCAGGCCACAACAGTCTCGCGAGTTTTTAATT 
25 CCAAGCGGGGGTTGCCCTCCGCTTGGAACAAAAACCGGGGTGCAGCCCTTGG 
TAC (SEQIDNO:12) 

(Overlaps product of primers 7 and 8 and extends to the novel 3' terminal 
sequence; includes an Xho I site at the 3' terminus of the GBV-B sequence to aid 
subsequent manipulations.) 
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RT-PCR products were gel-purified using the QIAquick Gel Extraction Kit 
(Qiagen) and ligated into a plasmid vector with the PstBlue-1 Perfectly Blunt Cloning Kit 
(Novagene). Bacterial colonies were screened for plasmid DNAs with the correct insert 
size. Those harboring an insert of the approximate correct size were subjected to 
diagnostic restriction analysis. cDNA inserts generating the appropriate restriction 
patterns were sequenced in both directions using an ABI DNA sequencer, and those with 
sequences closest to the published GBV-B sequence (Simons et_ were selected 

for subsequent assembly into the full-length clone. 

The cloned overlapping cDNA fragments were assembled into the pACNR1180 
plasmid using the unique restriction sites noted in the primer descriptions. The final, 
fully assembled plasmid bearing a 9.4 kb cDNA GBV-B insertion was subjected to DNA 
sequencing to confirm the validity of the construction and the absence of any mutations 
or errors in the sequence that may have been introduced during the assembly process. 
The sequence of the GBV-B infectious cDNA is shown in SEQ ID NO:2. 

Plasmid DNA containing the GBV-B cDNA was purified by isopycnic 
centrifiigation on a CsCl gradient and isolated by ethanol precipitation. The plasmid DNA 
was linearized at the 3' terminus of the cDNA insert by digestion with Xho I. Two jig of 
the linearized DNA was used as template in an in vitro transcription reaction using the 
Ambion T7 MEGAscript Kit according to the manufacturer's instructions. The integrity 
of the resultant synthetic RNA product was confirmed by nondenaturing agarose gel 
electrophoresis. 

Approximately 60 j^g of RNA transcribed from the GBV-B cDNA clone was 
diluted in phosphate buffered saline and injected under direct visualization at laparotomy 
into the liver of a healthy tamarin without a prior history of GBV-B infection {Saguinus 
oedipus). Serum samples were collected from the animal weekly beginning at 2 weeks 
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post-inoculation and tested for the presence of GBV-B RNA by real-time quantitative 
RT-PCR (TaqMan assay). Results are shown in the following table. 



Week Postinoculation 


Genome equivalents/ml 


0 


0 


2 


8.3 x 10 3 


3 


6.6 x 10 5 


4 


4.6 x 10 8 



5 The presence of high and increasing titers of viral RNA in the serum of this 

animal between 2-4 weeks postinoculation confirms the infectivity of the full-length 
cDNA clone containing the novel 3' terminal sequence. The presence of viremia 
demonstrated that viral replication had ensued following the delivery of the synthetic 
viral RNA to hepatocytes. 

10 

All of the compositions and methods disclosed and claimed herein can be made 
and executed without undue experimentation in light of the present disclosure. While the 

15 compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied 
to the compositions and methods and in the steps or in the sequence of steps of the 
method described herein without departing from the concept, spirit and scope of the 
invention. More specifically, it will be apparent that certain agents, which are both 

20 chemically and physiologically related, may be substituted for the agents described herein 
while the same or similar results would be achieved. All such similar substitutes and 
modifications apparent to those skilled in the art are deemed to be within the spirit, scope 
and concept of the invention as defined by the appended claims. 
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